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INTRODUCTION 


STRING THEORY IS A MYSTERY. IT’S SUPPOSED TO BE THE THE- 
ory of everything. But it hasn’t been verified experimen- 
tally. And it’s so esoteric. It’s all about extra dimensions, 
quantum fluctuations, and black holes. How can that be the 
world? Why can’t everything be simpler? 

String theory is a mystery. Its practitioners (of which I am 
one) admit they don’t understand the theory. But calculation 
after calculation yields unexpectedly beautiful, connected 
results. One gets a sense of inevitability from studying string 
theory. How can this not be the world? How can such deep 
truths fail to connect to reality? 

String theory is a mystery. It draws many talented gradu- 
ate students away from other fascinating topics, like super- 
conductivity, that already have industrial applications. It 
attracts media attention like few other fields in science. And 
it has vociferous detractors who deplore the spread of its 
influence and dismiss its achievements as unrelated to em- 
pirical science. 

Briefly, the claim of string theory is that the fundamental 
objects that make up all matter are not particles, but strings. 
Strings are like little rubber bands, but very thin and very 
strong. An electron is supposed to be actually a string, vibrat- 
ing and rotating on a length scale too small for us to probe 
even with the most advanced particle accelerators to date. In 


some versions of string theory, an electron is a closed loop of 
string. In others, it is a segment of string, with two endpoints. 

Let’s take a brief tour of the historical development of 
string theory. 

String theory is sometimes described as a theory that was 
invented backwards. Backwards means that people had pieces 
of it quite well worked out without understanding the deep 
meaning of their results. First, in 1968, came a beautiful for- 
mula describing how strings bounce off one another. The 
formula was proposed before anyone realized that strings had 
anything to do with it. Math is funny that way. Formulas can 
sometimes be manipulated, checked, and extended without 
being deeply understood. Deep understanding did follow in 
this case, though, including the insight that string theory in- 
cluded gravity as described by the theory of general relativity. 

In the 1970s and early '80s, string theory teetered on the 
brink of oblivion. It didn’t seem to work for its original pur- 
pose, which was the description of nuclear forces. While it 
incorporated quantum mechanics, it seemed likely to have 
a subtle inconsistency called an anomaly. An example of an 
anomaly is that if there were particles similar to neutrinos, but 
electrically charged, then certain types of gravitational fields 
could spontaneously create electric charge. That’s bad because 
quantum mechanics needs the universe to maintain a strict 
balance between negative charges, like electrons, and positive 
charges, like protons. So it was a big relief when, in 1984, it 
was shown that string theory was free of anomalies. It was 
then perceived as a viable candidate to describe the universe. 

This apparently technical result started the “first super- 
string revolution”: a period of frantic activity and dramatic 
advances, which nevertheless fell short of its stated goal, to 
produce a theory of everything. I was a kid when it got going, 
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and I lived close to the Aspen Center for Physics, a hotbed of 
activity. I remember people muttering about whether super- 
string theory might be tested at the Superconducting Super 
Collider, and I wondered what was so super about it all. Well, 
superstrings are strings with the special property of supersym- 
metry. And what might supersymmetry be? I'll try to tell you 
more clearly later in this book, but for now, let's settle for two 
very partial statements. First: Supersymmetry relates particles 
with different spins. The spin of a particle is like the spin of a 
top, but unlike a top, a particle can never stop spinning. Sec- 
ond: Supersymmetric string theories are the string theories 
that we understand the best. Whereas non-supersymmetric 
string theories require 26 dimensions, supersymmetric ones 
only require ten. Naturally, one has to admit that even ten 
dimensions is six too many, because we perceive only three 
of space and one of time. Part of making string theory into a 
theory of the real world is somehow getting rid of those extra 
dimensions, or finding some useful role for them. 

For the rest of the 1980s, string theorists raced furiously 
to uncover the theory of everything. But they didn’t under- 
stand enough about string theory. It turns out that strings are 
not the whole story. The theory also requires the existence of 
branes: objects that extend in several dimensions. The sim- 
plest brane is a membrane. Like the surface of a drum, a 
membrane extends in two spatial dimensions. It is a surface 
that can vibrate. There are also 3-branes, which can fill the 
three dimensions of space that we experience and vibrate in 
the additional dimensions that string theory requires. There 
can also be 4-branes, 5-branes, and so on up to 9-branes. All 
of this starts to sound like a lot to swallow, but there are solid 
reasons to believe that you can't make sense of string theory 
without all these branes included. Some of these reasons have 
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to do with “string dualities.” A duality is a relation between 
two apparently different objects, or two apparently differ- 
ent viewpoints. A simplistic example is a checkerboard. One 
view is that it’s a red board with black squares. Another view 
is that it’s a black board with red squares. Both viewpoints 
(made suitably precise) provide an adequate description of 
what a checkerboard looks like. They’re different, but related 
under the interchange of red and black. 

The middle 1990s saw a second superstring revolution, 
based on the emerging understanding of string dualities and 
the role of branes. Again, efforts were made to parlay this 
new understanding into a theoretical framework that would 
qualify as a theory of everything. “Everything” here means 
all the aspects of fundamental physics we understand and 
have tested. Gravity is part of fundamental physics. So are 
electromagnetism and nuclear forces. So are the particles, 
like electrons, protons, and neutrons, from which all atoms 
are made. While string theory constructions are known that 
reproduce the broad outlines of what we know, there are 
some persistent difficulties in arriving at a fully viable theory. 
At the same time, the more we learn about string theory, the 
more we realize we don't know. So it seems like a third 
superstring revolution is needed. But there hasn’t been one 
yet. Instead, what is happening is that string theorists are 
trying to make do with their existing level of understanding 
to make partial statements about what string theory might 
say about experiments both current and imminent. The 
most vigorous efforts along these lines aim to connect string 
theory with high-energy collisions of protons or heavy ions. 
The connections we hope for will probably hinge on the 
ideas of supersymmetry, or extra dimensions, or black hole 
horizons, or maybe all three at once. 


INTRODUCTION 


5 


Now that we're up to the modern day, let's detour to con- 
sider the two types of collisions I just mentioned. 

Proton collisions will soon be the main focus of experi- 
mental high-energy physics, thanks to a big experimental fa- 
cility near Geneva called the Large Hadron Collider (LHC). 
The LHC will accelerate protons in counter-rotating beams 
and slam them together in head-on collisions near the speed 
of light. This type of collision is chaotic and uncontrolled. 
What experimentalists will look for is the rare event where 
a collision produces an extremely massive, unstable particle. 
One such particle—still hypothetical—is called the Higgs 
boson, and it is believed to be responsible for the mass of the 
electron. Supersymmetry predicts many other particles, and 
if they are discovered, it would be clear evidence that string 
theory is on the right track. There is also a remote possi- 
bility that proton-proton collisions will produce tiny black 
holes whose subsequent decay could be observed. 

In heavy ion collisions, a gold or lead atom is stripped of all 
its electrons and whirled around the same machine that carries 
out proton-proton collisions. When heavy ions collide head- 
on, it is even more chaotic than a proton-proton collision. It's 
believed that protons and neutrons melt into their constituent 
quarks and gluons. The quarks and gluons then form a fluid, 
which expands, cools, and eventually freezes back into the 
particles that are then observed by the detectors. This fluid 
is called the quark-gluon plasma. The connection with string 
theory hinges on comparing the quark-gluon plasma to a black 
hole. Strangely, the kind of black hole that could be dual to the 
quark-gluon plasma is not in the four dimensions of our every- 
day experience, but in a five-dimensional curved spacetime. 

It should be emphasized that string theory's connections to 
the real world are speculative. Supersymmetry might simply 


INTRODUCTION 


6 


not be there. The quark-gluon plasma produced at the LHC 
may really not behave much like a five-dimensional black 
hole. What is exciting is that string theorists are placing 
their bets, along with theorists of other stripes, and holding 
their breaths for experimental discoveries that may vindicate 
or shatter their hopes. 

This book builds up to some of the core ideas of modern 
string theory, including further discussion of its potential 
applications to collider physics. String theory rests on two 
foundations: quantum mechanics and the theory of relativ- 
ity. From those foundations it reaches out in a multitude of 
directions, and it’s hard to do justice to even a small fraction 
of them. The topics discussed in this book represent a slice 
across string theory that largely avoids its more mathemati- 
cal side. The choice of topics also reflects my preferences and 
prejudices, and probably even the limits of my understand- 
ing of the subject. 

Another choice I’ve made in writing this book is to dis- 
cuss physics but not physicists. That is, I’m going to do my 
best to tell you what string theory is about, but I’m not going 
to tell you about the people who figured it all out (although 
I will say up front that mostly it wasn’t me). To illustrate 
the difficulties of doing a proper job of attributing ideas to 
people, let’s start by asking who figured out relativity. It was 
Albert Einstein, right? Yes—but if we just stop with that 
one name, we're missing a lot. Hendrik Lorentz and Henri 
Poincaré did important work that predated Einstein; Her- 
mann Minkowski introduced a crucially important math- 
ematical framework; David Hilbert independently figured 
out a key building block of general relativity; and there are 
several more important early figures like James Clerk Max- 
well, George FitzGerald, and Joseph Larmor who deserve 
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mention, as well as later pioneers like John Wheeler and 
Subrahmanyan Chandrasekhar. The development of quan- 
tum mechanics is considerably more intricate, as there is no 
single figure like Einstein whose contributions tower above 
all others. Rather, there is a fascinating and heterogeneous 
group, including Max Planck, Einstein, Ernest Rutherford, 
Niels Bohr, Louis de Broglie, Werner Heisenberg, Erwin 
Schródinger, Paul Dirac, Wolfgang Pauli, Pascual Jordan, 
and John von Neumann, who contributed in essential 
ways—and sometimes famously disagreed with one another. 
It would be an even more ambitious project to properly as- 
sign credit for the vast swath of ideas that is string theory. 
My feeling is that an attempt to do so would actually de- 
tract from my primary aim, which is to convey the ideas 
themselves. 

The aim of the first three chapters of this book is to in- 
troduce ideas that are crucial to the understanding of string 
theory, but that are not properly part of it. These ideas— 
energy, quantum mechanics, and general relativity—are 
more important (so far) than string theory itself, because we 
know that they describe the real world. Chapter 4, where 
I introduce string theory, is thus a step into the unknown. 
While I attempt in chapters 4, 5, and 6 to make string the- 
ory, D-branes, and string dualities seem as reasonable and 
well motivated as I can, the fact remains that they are un- 
verified as descriptions of the real world. Chapters 7 and 
8 are devoted to modern attempts to relate string theory 
to experiments involving high-energy particle collisions. 
Supersymmetry, string dualities, and black holes in a fifth 
dimension all figure in string theorists’ attempts to under- 
stand what is happening, and what will happen, in particle 
accelerators. 


INTRODUCTION 


In various places in this book, I quote numerical values 
for physical quantities: things like the energy released in 
nuclear fission or the amount of time dilation experienced 
by an Olympic sprinter. Part of why I do this is that phys- 
ics is a quantitative science, where the numerical sizes of 
things matter. However, to a physicist, what’s usually most 
interesting is the approximate size, or order of magnitude, 
of a physical quantity. So, for example, I remark that the 
time dilation experienced by an Olympic sprinter is about a 
part in 10? even though a more precise estimate, based on a 
speed of 10 m/s, is a part in 1.8 x 10. Readers wishing to 
see more precise, explicit, and/or extended versions of the 
calculations I describe in the book can visit this website: 
http://press.princeton.edu/titles/9133.html. 

Where is string theory going? String theory promises to 
unify gravity and quantum mechanics. It promises to pro- 
vide a single theory encompassing all the forces of nature. It 
promises a new understanding of time, space, and additional 
dimensions as yet undiscovered. It promises to relate ideas as 
seemingly distant as black holes and the quark-gluon plasma. 
Truly it is a “promising” theory! 

How can string theorists ever deliver on the promise 
of their field? The fact is, much has been delivered. String 
theory does provide an elegant chain of reasoning starting 
with quantum mechanics and ending with general relativ- 
ity. ll describe the framework of this reasoning in chapter 
4. String theory does provide a provisional picture of how 
to describe all the forces of nature. I'll outline this picture in 
chapter 7 and tell you some of the difficulties with making it 
more precise. And as I'll explain in chapter 8, string theory 
calculations are already being compared to data from heavy 
ion collisions. 
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I don’t aim to settle any debates about string theory in 
this book, but I'll go so far as to say that I think a lot of the 
disagreement is about points of view. When a noteworthy 
result comes out of string theory, a proponent of the theory 
might say, “That was fantastic! But it would be so much bet- 
ter if only we could do thus-and-such.” At the same time, a 
critic might say, “That was pathetic! If only they had done 
thus-and-such, I might be impressed.” In the end, the pro- 
ponents and the critics (at least, the more serious and in- 
formed members of each camp) are not that far apart on 
matters of substance. Everyone agrees that there are some 
deep mysteries in fundamental physics. Nearly everyone 
agrees that string theorists have mounted serious attempts to 
solve them. And surely it can be agreed that much of string 
theory’s promise has yet to be delivered upon. 
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Chapter ON E 


ENERGY 


THE AIM OF THIS CHAPTER IS TO PRESENT THE MOST FAMOUS 
equation of physics: E = mc. This equation underlies nu- 
clear power and the atom bomb. It says that if you convert 
one pound of matter entirely into energy, you could keep 
the lights on in a million American households for a year. 
E = mč also underlies much of string theory. In particular, 
as we'll discuss in chapter 4, the mass of a vibrating string 
receives contributions from its vibrational energy. 

What's strange about the equation E = mc’ is that it relates 
things you usually don't think of as related. E is for energy, 
like the kilowatt-hours you pay your electric company for 
each month; m is for mass, like a pound of flour; c is for the 
speed of light, which is 299,792,458 meters per second, or 
(approximately) 186,282 miles per second. So the first task 
is to understand what physicists call “dimensionful quan- 
tities,” like length, mass, time, and speed. Then we'll get 
back to E = mc’ itself. Along the way, I'll introduce metric 
units, like meters and kilograms; scientific notation for big 
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numbers; and a bit of nuclear physics. Although it’s not nec- 
essary to understand nuclear physics in order to grasp string 
theory, it provides a good context for discussing E = mc’. 
And in chapter 8, I will come back and explain efforts to 
use string theory to better understand aspects of modern 
nuclear physics. 


Length, mass, time, and speed 


Length is the easiest of all dimensionful quantities. It’s what 
you measure with a ruler. Physicists generally insist on using 
the metric system, so I'll start doing that now. A meter is 
about 39.37 inches. A kilometer is 1000 meters, which is 
about 0.6214 miles. 

Time is regarded as an additional dimension by physicists. 
We perceive four dimensions total: three of space and one 
of time. Time is different from space. You can move any di- 
rection you want in space, but you can’t move backward in 
time. In fact, you can’t really “move” in time at all. Seconds 
tick by no matter what you do. At least, that’s our everyday 
experience. But it’s actually not that simple. If you run in a 
circle really fast while a friend stands still, time as you expe- 
rience it will go by less quickly. If you and your friend both 
wear stopwatches, yours will show less time elapsed than 
your friend's. This effect, called time dilation, is impercepti- 
bly small unless the speed with which you run is comparable 
to the speed of light. 

Mass measures an amount of matter. We're used to think- 
ing of mass as the same as weight, but it's not. Weight has 
to do with gravitational pull. If you're in outer space, you're 
weightless, but your mass hasn't changed. Most of the mass 
in everyday objects is in protons and neutrons, and a little bit 
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more is in electrons. Quoting the mass of an everyday object 
basically comes down to saying how many nucleons are in it. 
A nucleon is either a proton or a neutron. My mass is about 75 
kilograms. Rounding up a bit, that’s about 50,000,000,000, 
000,000,000,000,000,000 nucleons. It’s hard to keep track of 
such big numbers. There are so many digits that you can’t 
easily count them up. So people resort to what’s called scien- 
tific notation: instead of writing out all the digits like I did 
before, you would say that I have about 5 x 107? nucleons in 
me. The 28 means that there are 28 zeroes after the 5. Let's 
practice a bit more. A million could be written as 1 x 10°, or, 
more simply, as 10°. The U.S. national debt, currently about 
$10,000,000,000,000, can be conveniently expressed as 10? 
dollars. Now, if only I had a dime for every nucleon in me... 
Let's get back to dimensionful quantities in physics. Speed 
is a conversion factor between length and time. Suppose you 
can run 10 meters per second. That's fast for a person—really 
fast. In 10 seconds you can go 100 meters. You wouldn't win 
an Olympic gold with that time, but you'd be close. Suppose 
you could keep up your speed of 10 meters per second over 
any distance. How long would it take to go one kilometer? 
Let's work it out. One kilometer is ten times 100 meters. You 
can do the 100-meter dash in 10 seconds flat. So you can run 
a kilometer in 100 seconds. You could run a mile in 161 sec- 
onds, which is 2 minutes and 41 seconds. No one can do that, 
because no one can keep up a 10 m/s pace for that long. 
Suppose you could, though. Would you be able to no- 
tice the time dilation effect I described earlier? Not even 
close. Time would run a little slower for you while you were 
pounding out your 2:41 mile, but slower only by one part in 
about 10? (that’s a part in 1,000,000,000,000,000, or a thou- 
sand million million). In order to get a big effect, you would 
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have to be moving much, much faster. Particles whirling 
around modern accelerators experience tremendous time 
dilation. Time for them runs about 1000 times slower than 
for a proton at rest. The exact figure depends on the particle 
accelerator in question. 

The speed of light is an awkward conversion factor for 
everyday use because it’s so big. Light can go all the way 
around the equator of the Earth in about 0.1 seconds. That’s 
part of why an American can hold a conversation by telephone 
with someone in India and not notice much time lag. Light 
is more useful when you're thinking of really big distances. 
The distance to the moon is equivalent to about 1.3 seconds. 
You could say that the moon is 1.3 light-seconds away from 
us. The distance to the sun is about 500 light-seconds. 

A light-year is an even bigger distance: it's the distance 
that light travels in a year. The Milky Way is about 100,000 
light-years across. The known universe is about 14 billion 
light-years across. That's about 1.3 x 107° meters. 


E= mc 


The formula E= mé is a conversion between mass and en- 
ergy. It works a lot like the conversion between time and 
distance that we just discussed. But just what is energy? The 
question is hard to answer because there are so many forms 
of energy. Motion is energy. Electricity is energy. Heat is en- 
ergy. Light is energy. Any of these things can be converted 
into any other. For example, a lightbulb converts electricity 
into heat and light, and an electric generator converts mo- 
tion into electricity. A fundamental principle of physics is 
that total energy is conserved, even as its form may change. 
In order to make this principle meaningful, one has to have 
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ways of quantifying different forms of energy that can be 
converted into one another. 

A good place to start is the energy of motion, also called 
kinetic energy. The conversion formula is K = mv’, where 
K is the kinetic energy, m is the mass, and v is the speed. 
Imagine yourself again as an Olympic sprinter. Through a 
tremendous physical effort, you can get yourself going at 
v — 10 meters per second. But this is much slower than the 
speed of light. Consequently, your kinetic energy is much 
less than the energy E in E = mc. What does this mean? 

It helps to know that E = mc” describes “rest energy.” 
Rest energy is the energy in matter when it is not mov- 
ing. When you run, you're converting a little bit of your 
rest energy into kinetic energy. A very little bit, actually: 
roughly one part in 10". It's no accident that this same 
number, one part in 10", characterizes the amount of time 
dilation you experience when you run. Special relativity 
includes a precise relation between time dilation and ki- 
netic energy. It says, for example, that if something is mov- 
ing fast enough to double its energy, then its time runs half 
as fast as if it weren't moving. 

It's frustrating to think that you have all this rest energy 
in you, and all you can call up with your best efforts is a tiny 
fraction, one part in 10?. How might we call up a greater 
fraction of the rest energy in matter? The best answer we 
know of is nuclear energy. 

Our understanding of nuclear energy rests squarely on 
E = mc’. Here is a brief synopsis. Atomic nuclei are made up 
of protons and neutrons. A hydrogen nucleus is just a proton. 
A helium nucleus comprises two protons and two neutrons, 
bound tightly together. What I mean by tightly bound is 
that it takes a lot of energy to split a helium nucleus. Some 
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nuclei are much easier to split. An example is uranium-235, 
which is made of 92 protons and 143 neutrons. It is quite 
easy to break a uranium -235 nucleus into several pieces. For 
instance, if you hit a uranium-235 nucleus with a neutron, 
it can split into a krypton nucleus, a barium nucleus, three 
neutrons, and energy. This is an example of fission. We could 
write the reaction briefly as 


U+n— Kr + Ba + 3n + Energy, 


where we understand that U stands for uranium-235, Kr 
stands for krypton, Ba stands for barium, and n stands for 
neutron. (By the way, I'm careful always to say uranium-235 
because there's another type of uranium, made of 238 nucle- 
ons, that is far more common, and also harder to split.) 

E = mc’ allows you to calculate the amount of energy 
that is released in terms of the masses of all the participants 
in the fission reaction. It turns out that the ingredients (one 
uranium-235 nucleus plus one neutron) outweigh the prod- 
ucts (a krypton atom, a barium atom, and three neutrons) 
by about a fifth of the mass of a proton. It is this tiny incre- 
ment of mass that we feed into E — mc to determine the 
amount of energy released. Tiny as it seems, a fifth of the 
mass of a proton is almost a tenth of a percent of the mass of 
a uranium-235 atom: one part in a thousand. So the energy 
released is about a thousandth of the rest energy in a ura- 
nium-235 nucleus. This still may not seem like much, but 
it's roughly a trillion times bigger as a fraction of rest energy 
than the fraction that an Olympic sprinter can call up in the 
form of kinetic energy. 

I still haven't explained where the energy released in nu- 
clear fission comes from. The number of nucleons doesnt 
change: there are 236 of them before and after fission. And 
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yet the ingredients have more mass than the products. So this 
is an important exception to the rule that mass is essentially 
a count of nucleons. The point is that the nucleons in the 
krypton and barium nuclei are bound more tightly than they 
were in the uranium-235 nucleus. Tighter binding means 
less mass. The loosely bound uranium-235 nucleus has a little 
extra mass, just waiting to be released as energy. To put it in 
a nutshell: Nuclear fission releases energy as protons and neu- 
trons settle into slightly more compact arrangements. 

One of the projects of modern nuclear physics is to fig- 
ure out what happens when heavy nuclei like uranium-235 
undergo far more violent reactions than the fission reaction 
I described. For reasons I won't go into, experimentalists 
prefer to work with gold instead of uranium. When two 
gold nuclei are slammed into one another at nearly the speed 
of light, they are utterly destroyed. Almost all the nucleons 
break up. In chapter 8, I will tell you more about the dense, 
hot state of matter that forms in such a reaction. 

In summary, E — mc says that the amount of rest energy 
in something depends only on its mass, because the speed 
of light is a known constant. It's easier to get some of that 
energy out of uranium-235 than most other forms of mat- 
ter. But fundamentally, rest energy is in all forms of matter 
equally: rocks, air, water, trees, and people. 

Before going on to quantum mechanics, let's pause to 
put E = mc’ in a broader intellectual context. It is part of 
special relativity, which is the study of how motion affects 
measurements of time and space. Special relativity is sub- 
sumed in general relativity, which also encompasses gravity 
and curved spacetime. String theory subsumes both general 
relativity and quantum mechanics. In particular, string the- 
ory includes the relation E = mc’. Strings, branes, and black 
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holes all obey this relation. For example, in chapter 5 [ll 
discuss how the mass of a brane can receive contributions 
from thermal energy on the brane. It wouldn't be right to say 
that E = mc follows from string theory. But it fits, seemingly 
inextricably, with other aspects of string theory’s mathemati- 
cal framework. 


CHAPTER ONE 


Chapter T WO 


QUANTUM MECHANICS 


AFTER I GOT MY BACHELOR’S DEGREE IN PHYSICS, I SPENT A 
year at Cambridge University studying math and physics. 
Cambridge is a place of green lawns and grey skies, with an 
immense, historical weight of genteel scholarship. I was a 
member of St. John’s College, which is about five hundred 
years old. I particularly remember playing a fine piano lo- 
cated in one of the upper floors of the first court—one of 
the oldest bits of the college. Among the pieces I played was 
Chopin’s Fantasie-Impromptu. The main section has a per- 
sistent four-against-three cross rhythm. Both hands play in 
even tempo, but you play four notes with the right hand for 
every three notes in the left hand. The combination gives 
the composition an ethereal, liquid sound. 

It’s a beautiful piece of music. And it makes me think 
about quantum mechanics. To explain why, I will introduce 
some concepts of quantum mechanics, but I won't try to 
explain them completely. Instead, I will try to explain how 
they combine into a structure that is, to me, reminiscent of 
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music like the Fantasie-Impromptu. In quantum mechanics, 
every motion is possible, but there are some that are pre- 
ferred. These preferred motions are called quantum states. 
They have definite frequencies. A frequency is the number 
of times per second that something cycles or repeats. In the 
Fantasie-Impromptu, the patterns of the right hand have a 
faster frequency, and the patterns ofthe left hand have a slower 
frequency, in the ratio four to three. In quantum systems, the 
thing that is cycling is more abstract: technically, it's the phase 
of the wave function. You can think ofthe phase of the wave 
function as similar to the second hand of a clock. The second 
hand goes around and around, once per minute. The phase 
is doing the same thing, cycling around at some much faster 
frequency. This rapid cycling characterizes the energy of the 
system in a way that I'll discuss in more detail later. 

Simple quantum systems, like the hydrogen atom, have 
frequencies that stand in simple ratios with one another. For 
example, the phase of one quantum state might cycle nine 
times while another cycles four times. That's a lot like the 
four-against-three cross rhythm of the Fantasie-Impromptu. 
But the frequencies in quantum mechanics are usually a lot 
faster. For example, in a hydrogen atom, characteristic fre- 
quencies are on the scale of 10"? oscillations or cycles per sec- 
ond. That’s indeed a lot faster than the Fantasie-Impromptu, 
in which the right hand plays about 12 notes per second. 

The rhythmic fascination of the Fantasie-Impromptu 
is hardly its greatest charm—at least, not when it's played 
rather better than I ever could. Its melody floats above a 
melancholy bass. The notes run together in a chromatic blur. 
The harmonies shift slowly, contrasting with the almost des- 
ultory flitting of the main theme. The subtle four-against- 
three rhythm provides just the backdrop for one of Chopin's 
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more memorable compositions. Quantum mechanics is 
like this. Its underlying graininess, with quantum states at 
definite frequencies, blurs at larger scales into the colorful, 
intricate world of our experience. Those quantum frequen- 
cies leave an indelible mark on that world: for example, the 
orange light from a street lamp has a definite frequency, 
associated with a particular cross rhythm in sodium atoms. 
The frequency of the light is what makes it orange. 

In the rest of this chapter, I’m going to focus on three 
aspects of quantum mechanics: the uncertainty principle, 
the hydrogen atom, and the photon. Along the way, we'll 
encounter energy in its new quantum mechanical guise, 
closely related to frequency. Analogies with music are apt 
for those aspects of quantum mechanics having to do with 
frequency. But as we'll see in the next section, quantum 
physics incorporates some other key ideas that are less read- 
ily compared with everyday experience. 


Uncertainty 


One of the cornerstones of quantum mechanics is the un- 
certainty principle. It says that a particle's position and mo- 
mentum can never be simultaneously measured. That's an 
oversimplification, so let me try to do better. Any measure- 
ment of position will have some uncertainty, call it Ax (pro- 
nounced “Delta x"). For instance, if you measure the length 
of a piece of wood with a tape measure, you can usually get 
it right to within 1/32 of an inch if you're careful. That's a 
little less than a millimeter. So for such a measurement, one 
would say Ax © 1 mm: that is, “Delta x (the uncertainty) is 
approximately one millimeter." Despite the Greek letter A, 
the concept here is simple: A carpenter might call out to his 
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buddy, “Jim, this board is within a millimeter of two meters 
long.” (Of course, Im referring to a European carpenter, 
since the guys I've seen in the United States prefer feet and 
inches. What the carpenter means is that the length of the 
board is x = 2 meters, with an uncertainty Ax © 1 mm. 

Momentum is familiar from everyday experience, but 
to be precise about it, it helps to consider collisions. If two 
things collide head-on and the impact stops them both com- 
pletely, then they had equal momentum before the collision. 
If after the collision one is still moving in the direction it 
started, but slower, then it had larger momentum. There's a 
conversion formula from mass m to momentum p: p = mv. 
But let's not worry about the details just yet. The point is that 
momentum is something you can measure, and the measure- 
ment has some uncertainty, which we'll call Ap. 

The uncertainty principle says Ap x Ax > h/4n, where h is 
a quantity called Planck’s constant and x = 3.14159 . . . is the 
familiar ratio of the circumference of a circle to its diameter. 
I would read this formula aloud as “Delta p times Delta x is 
no less than h over 4 Pi.” Or, if you prefer, “The product 
of the uncertainties in a particles momentum and position 
is never less than Planck’s constant divided by 4 Pi.” Now 
you can see why my original statement of the uncertainty 
principle was an oversimplification: You can measure posi- 
tion and momentum simultaneously, but the uncertainty in 
those two measurements can never be smaller than what the 
equation Ap x Ax > h/4n allows. 

To understand an application of the uncertainty principle, 
think of capturing a particle in a trap whose size is Ax. The 
position of the particle is known within an uncertainty Ax if 
it is in the trap. The uncertainty principle then says is that it’s 
impossible to know the momentum of the trapped particle 
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more precisely than a certain bound. Quantitatively, the un- 
certainty in the momentum, Ap, has to be large enough so 
that the inequality Ap x Ax > h/4r is satisfied. Atoms provide 
an example of all this, as we'll see in the next section. It's hard 
to give a more everyday example, because typical uncertain- 
ties Ax are much smaller than objects that you can hold in 
your hand. That’s because Planck’s constant is numerically 
very small. We'll encounter it again when we discuss pho- 
tons, and ll tell you then what its numerical value actually is. 

The way you usually talk about the uncertainty princi- 
ple is to discuss measurements of position and momentum. 
But it goes deeper than that. It is an intrinsic limitation on 
what position and momentum mean. Ultimately, position 
and momentum are not numbers. They are more compli- 
cated objects called operators, which I won’t try to describe 
except to say that they are perfectly precise mathematical 
constructions—just more complicated than numbers. The 
uncertainty principle arises from the difference between 
numbers and operators. The quantity Ax is not just the un- 
certainty of a measurement; it is the irreducible uncertainty 
of the particle’s position. What the uncertainty principle 
captures is not a lack of knowledge, but a fundamental fuzz- 
iness of the subatomic world. 


The atom 


The atom is made up of electrons moving around the atomic 
nucleus. As we've already discussed, the nucleus is made up 
of protons and neutrons. The simplest case to start with is 
hydrogen, where the nucleus is just a proton, and there's 
only one electron moving around it. The size of an atom is 
roughly 10? meters, also known as an angstrom. (Saying that 
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an angstrom is 10" meters means that a meter is 10'°, or ten 
billion, angstroms.) The size of a nucleus is about a hundred 
thousand times smaller. When one says that an atom is about 
an angstrom across, it means the electron rarely goes further 
away from the nucleus than this. The uncertainty Ax in the 
position of the electron is about an angstrom, because from 
instant to instant it’s impossible to say on which side of the 
nucleus the electron will find itself. The uncertainty principle 
then says that there’s an uncertainty Ap in the momentum of 
the electron, satisfying Ap x Ax > h/4n. The way this comes 
about is that the electron in the hydrogen atom has some 
average speed—about a hundredth of the speed of light—but 
which direction it is moving in changes from moment to 
moment and is fundamentally uncertain. The uncertainty in 
the momentum of the electron is essentially the momentum 
itself, because of this uncertainty in direction. The overall 
picture is that the electron is trapped by its attraction to the 
nucleus, but quantum mechanics does not permit the elec- 
tron to rest in this trap. Instead, it wanders ceaselessly in a 
way that the mathematics of quantum mechanics describes. 
This insistent wandering is what gives the atom its size. If 
the electron were permitted to sit still, it would do so inside 
the nucleus, because it is attracted to the nucleus. Matter it- 
self would then collapse to the density of the nucleus, which 
would be very uncomfortable! So the quantum wanderings 
of the electrons inside atoms are really a blessing. 

Although the electron in a hydrogen atom has an uncer- 
tain position and an uncertain momentum, it has a definite 
energy. Actually, it has several possible energies. The way 
physicists describe the situation is to say that the electron’s en- 
ergy is “quantized.” That means that it has to choose among 
a definite set of possibilities. To appreciate this strange state 
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of affairs, let’s go back to kinetic energy in an everyday ex- 
ample. We learned about the conversion formula, K = mv’. 
Let’s say we apply it to a car. By giving the car more and 
more gas, you can pick out whatever speed v that you want. 
However, if energy were quantized for a car, you wouldn’t be 
able to do this. For example, you might be able to go 10 miles 
per hour, or 15, or 25, but not 11 or 12 or 12.5 miles per hour. 
The quantized energy levels of the electron in hydrogen 
bring me back to analogies with music. I already intro- 
duced one such analogy: the cross-rhythms in the Fantasie- 
Impromptu. A steady rhythm is itself a frequency. Each 
quantized energy level in hydrogen corresponds to a different 
frequency. An electron can pick one of these levels. If it does, 
that's like having a single steady rhythm, like a metronome. 
But an electron can also choose to be partly in one energy 
level and partly in another. That's called a superposition. The 
Fantasie-Impromptu is a "superposition" of two different 
rhythms, one carried by the right hand and one by the left. 
So far, l've told you that electrons in atoms have quantum 
mechanically uncertain position and momentum, but quan- 
tized energies. Isn't it strange that energies should be fixed 
to definite values when position and momentum cannot be 
fixed? To understand how this comes about, let's detour into 
another analogy with music. Think ofa piano string. When 
struck, it vibrates with a definite frequency, or pitch. For 
example, A above middle C on a piano vibrates 440 times in 
a second. Often, physicists quote frequencies in terms of the 
hertz (abbreviated Hz), which is one cycle or oscillation per 
second. So A above middle C has frequency 440 Hz. That's 
much faster than the rhythms of the Fantasie-Impromptu, 
where, if you recall, the right hand plays about 12 notes in a 
second: a frequency of 12 Hz. But it’s still much, much slower 
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than a hydrogen atom’s frequencies. Actually, the motion of 
the string is more complicated than a single vibration. There 
are overtones at higher frequencies. These overtones give a 
piano its characteristic sound. 

This may seem distant from the quantum mechanical mo- 
tion of an electron in a hydrogen atom. But in fact, it’s closely 
related. The lowest energy of an electron in hydrogen is like 
the fundamental frequency of a piano string: 440Hz for A 
above middle C. Oversimplifying a little, the frequency ofan 
electron in its lowest energy state is about 3 x 10'°Hz. The 
other energies that are possible for an electron are like the 
overtones of a piano string. 

The waves on a piano string and the quantum mechanical 
motions of an electron in a hydrogen atom are examples of 
standing waves. A standing wave is a vibration that doesn’t 
go anywhere. The piano string is tied down at the ends, so 
its vibrations are confined to its length. The quantum me- 
chanical motions of an electron in a hydrogen atom are con- 
fined to a much smaller space, little more than an angstrom 
across. The main idea behind the mathematics of quantum 
mechanics is to treat the motions of the electron as a wave. 
When the wave has a definite frequency, like the fundamen- 
tal frequency of a piano string, the electron has a definite 
energy. But the position of the electron is never a definite 
number, because the wave describing it is everywhere in the 
atom at once, just as the vibration of a piano string is a vibra- 
tion of the whole string at once. All you can say is that the 
electron is almost always within an angstrom of the nucleus. 

Having learned that electrons are described by waves, you 
might ask: Waves on what? This is a hard question. One an- 
swer is that it doesn’t seem to matter. Another is that there is 
an “electron field” permeating all of spacetime, and electrons 
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1A = 10m 


electron 


classical hydrogen atom quantum hydrogen atom 


Left: The classical picture of a hydrogen atom, where an electron orbits 
around a proton. Right: The quantum picture in terms of standing waves. 
Instead of following a definite orbit, the electron is represented as a standing 
wave. Itis not at a definite position, but it does have a definite energy. 


are excitations of it. The electron field is like the piano string, 
and the electron is like a vibration of the piano string. 
Waves are not always confined to small spaces like the 
inside of an atom. For example, waves on the sea can travel 
many miles before breaking on the shore. There are exam- 
ples of traveling waves in quantum mechanics: photons, for 
instance. But before we delve into photons, there’s a techni- 
cality which I have to discuss because it relates to things that 
will come up again in later chapters. I quoted a frequency for 
an electron in hydrogen, and I remarked that I was oversim- 
plifying. To explain how I was oversimplifying, lll introduce 
one more formula: E = hv. Here E is energy, v is frequency, 
and h the same Planck’s constant that came up in the uncer- 
tainty principle. E = hv is a wonderful formula, because it 
tells you what frequency really means: it’s just energy in a 
new guise. But here’s the trouble: There are different kinds 
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of energy. The electron has rest energy. It has kinetic energy. 
And it has binding energy, which is the amount of energy it 
takes to knock the electron free of the proton. Which should 
you use in the formula E = hv? When I quoted the figure 
3 x 10? oscillations per second for hydrogen, I was using the 
kinetic energy plus the binding energy, with the rest energy 
excluded. But this was arbitrary. I could have included the 
rest energy if I felt like it. That means that frequency has 
some ambiguity in quantum mechanics, which seems awful. 

Here’s how the difficulties get resolved. You can ask what 
happens when an electron jumps from one energy level to 
another. If the electron jumps down in energy, then it sheds 
the excess energy by emitting a photon. The photon’s en- 
ergy is the difference between the electron’s energy before 
it jumped and after. Now it doesn’t matter whether you in- 
clude the rest energy of the electron, because all you care 
about is the energy difference before and after the electron 
jumped levels. The proper use of the formula E = hv is 
to set E equal to the energy of the photon. Then v is the 
frequency of the photon, which is a definite number with 
no ambiguities. There’s just one thing left to settle: Exactly 
what does the frequency of a photon mean? This is what I 
want to explain next. 


The photon 


For centuries, a debate raged in physics. Is light a particle, or 
is it a wave? Quantum mechanics resolved the debate in an 
unexpected way: it’s both. 

To appreciate the wavelike characteristics of light, imag- 
ine an electron who decides to go sunbathing in a laser 
beam. A laser beam is a steady, coherent, intense beam of 
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light. Here’s the key point: When the electron steps into 
the laser beam, it pulls him first to one side, then the other, 
back and forth at some frequency. That frequency is the one 
that enters into the equation E = hv. Visible light has a fre- 
quency somewhat less than 10? oscillations per second. 
This analogy is fanciful, but it's easy to give a much more 
practical example. Radio waves are really the same thing as 
light, just with a much smaller frequency. FM radio waves 
have a frequency of about 10° oscillations per second, or 
10*Hz. One of the most popular stations where I live is 
New Jersey 101.5, which broadcasts at 101.5 megahertz. One 
megahertz is a million hertz, or 10* Hz. So 100 megahertz is 
10* Hz. Thus 101.5 megahertz is just a smidgen over 10° oscil- 
lations per second. An FM radio is constructed so that elec- 
trons inside it can oscillate at just about this frequency. When 
you tune a radio, what you are adjusting is the frequency 
at which the electrons inside its circuitry prefer to oscillate. 
Much like our sunbathing electron, the electrons inside the 
radio soak up the radio waves washing over the radio. 
Another analogy that might help is a buoy in the ocean. 
Typically, a buoy is attached by a chain to an anchor at the 
bottom of the ocean so that it doesn't get washed away by 
the waves and currents. The way it responds to waves is to 
bob up and down, staying at the surface of the water. This 
1s similar to the way the sunbathing electron responds to the 
laser beam. There's actually more to the story ofthe sunbath- 
ing electron: eventually it gets pushed in the direction of the 
laser beam, unless it is somehow tied down like the buoy. 
So far, my explanation has focused on the wavelike prop- 
erties of light. In what ways does it behave like a particle? 
There's a famous phenomenon called the photoelectric ef- 
fect which provides evidence that light really is composed 
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of photons, each with energy E = hv. Here’s how it works. 
If you shine light on a metal, you knock electrons out of it. 
With a clever experimental apparatus, you can detect those 
electrons and even measure their energy. The results of such 
measurements are consistent with the following explana- 
tion. Light, composed of many photons, delivers a series of 
little kicks to the metal. Each kick arises when a photon hits 
one of the electrons in the metal. Sometimes, if the photon 
has enough energy, it can kick the electron it hits clean out 
of the metal. According to the equation E = hv, higher 
frequency means higher energy. It’s known that blue light 
has a frequency approximately 35% higher than red light. 
That means a blue photon has 35% more energy than a red 
photon. Suppose you use sodium to study the photoelectric 
effect. It so happens that red photons aren’t quite energetic 
enough to knock electrons out of sodium. You don’t see 
any electrons getting knocked out even if you make the 
red light quite bright. But blue photons, with their extra 
increment of energy, do have enough energy to knock elec- 
trons out of sodium. Even a very faint blue ight will do the 
trick. What matters is not the brightness—which is related 
to how many photons there are—but the color of the light, 
which determines the energy of each photon. 

The minimum frequency of light it takes to kick electrons 
out of sodium is 5.5 x10" oscillations per second, which 
means green light. The corresponding energy, according to 
the equation E = hv, is 2.3 electron volts. An electron volt 
is the amount of energy a single electron acquires when it 
is pushed on by a one-volt power supply. So the numeri- 
cal value of Planck's constant is 2.3 electron volts divided 
by 5.5 x10" oscillations per second. That's usually quoted 
briefly as 4.1 x 10 ? electron-volt-seconds. 
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In summary, light behaves like a wave in many circum- 
stances, and like a particle in others. This is called wave- 
particle duality. According to quantum mechanics, it’s not 
just light that exhibits wave-particle duality: everything does. 

Let’s return to the hydrogen atom for a moment. I tried to 
explain in the last section how its quantized energy levels can 
be thought of as standing waves with definite frequencies. 
That’s an example of how electrons behave like waves. But 
if you remember, I got hung up on how to explain just what 
the frequency meant. I introduced the formula E = hv, but 
then I ran into trouble on the question of whether to include 
the rest energy of the electron in E. With photons, there's no 
such difficulty. The frequency of light really means some- 
thing tangible. It's the frequency that you tune a radio to 
receive. So when an electron jumps from one energy level to 
another, emitting a single photon in the process, you can use 
the frequency of the photon to judge unambiguously what 
the energy difference is between the two levels. 


I hope that the discussion so far has given you a fairly good 
feeling for what photons are. Understanding them com- 
pletely is remarkably difficult. The difficulties hinge on a 
concept called gauge symmetry, which I'll discuss at some 
length in chapter 5. In the rest of this section, let's explore 
how photons weave together concepts from special relativity 
and quantum mechanics. 

The theory of relativity is based on the assumption that 
light in a vacuum always goes at the same speed (299,792,458 
meters per second), and that nothing can go faster. Everyone 
who contemplates these claims is eventually struck by the 
thought that if you accelerated yourself to the speed of light 
and then fired a pistol in the direction you were moving, the 
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bullet would then be moving faster than the speed of light. 
Right? Not so fast. The trouble is connected with time dila- 
tion. Remember how I remarked that time ran 1000 times 
slower for particles in modern particle accelerators? It’s be- 
cause they're moving close to the speed of light. If, instead 
of moving close to the speed of light, you move at the speed 
of light, then time stops completely. You'd never fire that 
pistol, because you'd never have time to pull the trigger. 

It might seem that this argument leaves a little wiggle 
room. Maybe you could get yourself to within 10 m/s of the 
speed of light. Time would be moving as slow as molasses for 
you, but eventually you could squeeze off a shot from your 
pistol. When you did, the bullet would be moving relative to 
you quite a bit faster than 10 m/s, so surely it would exceed 
the speed of light. Right? Well, it just never works out that 
way. The faster you're going, the harder it is to get anything 
moving faster than you are. It's not because there's some kind 
of headwind blowing in your face: this could all be happening 
in outer space. It's because of the way time, length, and speed 
get tangled up in special relativity. Everything in relativity 
is cooked up in just such a way as to frustrate any attempt to 
go faster than light. Given the many successes of relativity in 
describing the world, most physicists are inclined to accept its 
main claim at face value: You just can't go faster than light. 

Now, what about the additional claim that light always 
goes at the same speed in a vacuum? This claim can be tested 
experimentally, and it seems to be true, no matter what fre- 
quency of light you use. This means that there is quite a stark 
contrast between photons and other particles, like electrons 
or protons. Electrons and protons can be fast or slow. If they're 
fast, they have lots of energy. If they're slow, they have less 
energy. But an electron by itself never has less energy than its 
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rest energy, E = mc’. Likewise the energy of a proton by itself 
is never less than its rest energy. A photon’s energy, however, 
is E = hv, and the frequency v can be as big or as small as we 
like without changing the speed of the photon. In particular, 
there’s no lower limit on the energy of a photon. What that 
must mean is that the rest energy of a photon is zero. If we 
use E = mc’, then we conclude that a photon’s mass must 
be zero. That’s the crucial difference between a photon and 
most other particles: a photon has no mass. 

It won't matter to future discussions in this book, but 
it's nice to know that it's only in a vacuum that light has a 
fixed speed. Light in fact does slow down when it's passing 
through matter. What I have in mind now is a very differ- 
ent situation than visible light hitting sodium: instead, I’m 
thinking of light passing through a transparent substance 
like water or glass. When passing through water, light slows 
down by a factor of about 1.33. When it's passing through 
glass, it can slow down by more, but never as much as a fac- 
tor of 2. Diamond slows light down by a factor of 2.4. This 
large factor, together with the clarity of diamond, gives it its 
unique, winking brilliance. 
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GRAVITY AND BLACK HOLES 


ONE FINE SUMMER DAY SOME YEARS AGO, I DROVE WITH MY 
father up to Grotto Wall, a popular climbing crag near Aspen, 
Colorado. We aimed to climb a classic moderate route called 
Twin Cracks. After we finished it without incident, I trotted 
out another idea: aid-climbing a harder route called Cryogen- 
ics. Aid-climbing means putting pieces of hardware into the 
rock that support your weight, instead of holding on with your 
hands and feet. You tie yourself to a rope, and you clip the rope 
to all the hardware you place, so that if the piece you're stand- 
ing on pulls out, the ones below it will catch your fall. 

Cryogenics was a perfect place to practice aid-climbing, I 
thought, because it was mostly overhanging. If you fell, you 
wouldn't slide painfully down the rock; instead, you would 
fall a short way and then dangle from the rope. Or you might 
fall until you hit the ground— but that possibility didn't seem 
too likely. The other good thing about Cryogenics, I thought, 
was that it had a crack a couple of fingers wide going most of 
the way up, so I could place as much gear as I wanted. 
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My Dad was agreeable, so I racked up and hopped on the 
route. Only then did I realize that my plan had some draw- 
backs. The rock wasn’t great inside the crack. It took a lot of 
gear, but it was not easy for me to get a really bomb-proof piece 
in. And though it was a short pitch, it really ate up the gear, so 
that when I got near the top, I was running seriously short of 
the most effective pieces. The last bit was the hardest to free- 
climb, and I had almost no gear left. But I was almost there! I 
placed a marginal nut in a flaring crack. I stepped up on it, and it 
held. I placed a hex in the same crack. I stepped up on the hex, it 
pulled out, and I fell. What happened next passed in a flash, and 
I don’t remember it, but it’s easy enough to reconstruct. 

The nut pulled out. I fell into empty space. The next nut 
pulled out. Climbers call this phenomenon “zippering,” be- 
cause it’s like unzipping a zipper. If enough pieces pull out, 
then you hit the ground. Every time a piece pulls, the one 
below it has to withstand a harder pull, because you have 
gathered more speed and more momentum. With a jerk, I 
came up short on the next piece. It was a cam, which is the 
most sophisticated piece of hardware in a climber’s arsenal. 
It was not ideally placed, but it held. My Dad, who was 
holding the rope while seated on the ground below, skidded 
forward as the rope between us came taut. 

And that was it. I spent some time studying the cam that 
held my fall. It seemed to have pulled and turned a little, but 
it was still OK. I improved a couple of the placements of pieces 
just below it, then lowered off the climb. I walked around for a 
few minutes thinking about how hard the ground was. I went 
back up the rope, recovered most of my gear, and called it a day. 

What can we learn from my experience on Cryogenics? 
Well, the first thing is this: when aid-climbing, you should 
stop when you run out of gear. 
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The second thing is that falling is not a problem. Landing 
is the problem. I walked away without a scratch because I 
didn’t hit the ground. (I did get a nosebleed a few minutes 
later.) Coming up short on that cam felt like a jerk, but it 
was a mild sort of jerk compared to the awful, crunching 
impact of hitting the ground. 

There’s a profound lesson about gravity that you can learn 
from falling. While you're doing it, you feel no gravity. You 
feel weightless. You get a similar feeling, to a lesser degree, 
when an elevator starts going down. Id like to tell you that 
I have some sort of deeper appreciation for gravity based on 
my intimate personal experience with falling. The fact is, on 
Cryogenics, either I didn't have time to appreciate the experi- 
ence, or I was too freaked out for rational thought to kick in. 


Black holes 


What would it be like to fall into a black hole? Would there 
be an awful, crunching impact? Or would you just fall for- 
ever? Let's take a quick tour of the properties of black holes 
to find out the answers. 

First of all, a black hole is an object from which no light 
can escape. "Black" is meant to convey the total darkness of 
such an object. The surface of a black hole is called its ho- 
rizon, because someone outside the horizon can't see what 
happens inside. That's because seeing involves light, and no 
light can get out of a black hole. Black holes are believed to 
exist at the centers of most galaxies. They are also thought 
to be the final stage of evolution of very massive stars. 

The strangest thing of all about black holes is that they are 
just empty space, except for a "singularity" at their center. 
This might seem like nonsense: how can the most massive 
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100,000 light-years V. 


Our galaxy, the Milky Way, probably has a black hole at its center. The 
mass of the black hole is thought to be about four million times the 
mass of the sun. From Earth’s perspective, it lies in the direction of the 
Sagittarius constellation. It’s about 26,000 light-years away from us. 
The black hole is much smaller than the size depicted here, and so is the 
region around it empty of stars. 


object in the galaxy be empty space? The answer is that 
all the mass inside the black hole collapses into the singu- 
larity. We actually don’t understand what happens right at 
the singularity. What we do understand is that the singular- 
ity distorts spacetime in such a way that there is a horizon 
surrounding it. Anything that gets within the horizon will 
eventually be drawn into the singularity. 
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Imagine a rock climber unfortunate enough to fall into 
a black hole. Crossing the horizon wouldn’t cause him any 
injury, because there’s nothing there: it’s empty space. The 
rock climber probably wouldn’t even notice when he fell 
through the horizon. The trouble is, nothing could then 
stop his fall. First of all, there’s nothing to hold onto— 
remember, it’s all empty space inside a black hole except 
for the singularity. The climber’s only hope is his rope. But 
even if the rope were clipped to the most bomb-proof piece 
of gear in history, it wouldn’t do any good. The gear might 
hold, but the rope would break, or it would stretch and 
stretch until the climber hit the singularity. When that hap- 
pened, presumably there really would be an awful, crunch- 
ing impact. But it’s hard to know for sure, because no one 
could observe it except the climber himself. That’s because 
no light can get out of a black hole! 

The main thing to take away from this discussion is that 
gravity’s pull inside a black hole is absolutely irresistible. Once 
he passed through the horizon, our unfortunate rock climber 
could no more stop his fall than he could stop time. And yet, 
nothing would “hurt” until he hit the singularity. Up until 
then, all he’d be doing is falling in empty space. He would 
feel weightless, as I must have felt while falling off of Cryo- 
genics. This highlights a fundamental premise of general rela- 
tivity: A freely falling observer feels like he’s in empty space. 

Here’s another analogy that might help. Imagine a lake 
in the mountains, drained by a small, swift stream. Fish in 
the lake know not to venture too close to the mouth of that 
stream, because if they start down the stream, it’s impossible 
for them to swim fast enough to escape the current and get 
back up into the lake. Unwise fish who let themselves be 
drawn into the stream don’t get hurt (at least not right away), 
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The black hole horizon is the point of no return. A spaceship can go 
close to it and then turn around and escape. But if the spaceship goes 
inside the horizon, it can never get back out. 


but they have no choice but to keep on being drawn down- 
stream. The lake is like spacetime outside a black hole, and 
the interior of the black hole is like the stream. The singular- 
ity would be like some sharp rocks upon which the stream 
breaks, and upon which any fish in the stream would come to 
an immediate and sanguinary end. You might imagine other 
possibilities: for example, maybe the stream leads to another 
lake which a fish can reach in perfect safety and comfort. 
Likewise, maybe there isn't a singularity inside a black hole 
after all, but instead a tunnel to another universe. That seems 
a bit far-fetched, but given that we don't understand singu- 
larities and can't find out what's inside a black hole except by 
falling into one, it can't be ruled out completely. 

In an astrophysical setting, there's an important caveat to 
the idea that you feel nothing as you approach a black hole 
and then cross the horizon. The caveat has to do with tidal 
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forces. Tidal forces are so named because of how ocean tides 
arise. The moon pulls on the earth more strongly on the 
side closest to it. The sea rises on that side in response to 
the moon’s pull. The sea also rises on the opposite side of 
the earth, which might seem pretty counterintuitive. But 
think of it this way: the middle of the earth is getting pulled 
toward the moon more than the oceans on the side opposite 
to the moon. Those oceans rise because they’re getting left 
behind. Everything else moves toward the moon more than 
they do, because everything else is closer to the moon, and 
more affected by it. 

When an object like a star gets close to a black hole, there’s 
a similar effect. The parts of the star closer to the black hole 
are pulled in more strongly, and the star gets elongated as a 
result. As the star approaches the black hole horizon, it even- 
tually gets shredded to bits. This shredding involves both 
tidal forces and the star’s rotational motion around the black 
hole. In order to avoid unnecessary complications, let’s ig- 
nore rotation and just think about a star falling straight into 
the black hole. Let’s simplify further by replacing the star by 
two freely falling observers, initially separated by a distance 
equal to the diameter of the star. What I have in mind is that 
the trajectories of these two observers are supposed to be 
similar to the trajectories of the parts of the star nearest and 
farthest from the black hole. I'll refer to the observer who 
starts off closer to the black hole as the near-side observer. 
The other one is the far-side observer. The black hole pulls 
more strongly on the near-side observer, just because she’s 
closer. So she starts falling faster than the far-side observer, 
with the result that the observers wind up getting farther 
apart as they fall. From their point of view, there appears to 
be a force pulling them apart. That apparent force 1s the tidal 
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force, which is simply an expression of the fact that at any 
given time, gravity pulls harder on the near-side observer 
than on the far-side observer. 

It may help to consider another analogy. Imagine a line of 
cars that start all bunched up on slow traffic. When the first 
car gets to a place where it can speed up, it pulls away from 
the car behind it. Even when the second car speeds up in the 
same place, it will remain a greater distance behind the first 
one. That’s very similar to the way the distance between the 
near-side and far-side observers increases when they start 
falling toward the black hole. The elongation of a star falling 
toward a black hole is essentially the same phenomenon— 
except that in order to give a fully realistic description, one 
would have to account for the rotational motion of the star 
around the black hole, and eventually also the peculiar dis- 
tortion of time near a black hole horizon. 

Modern experiments aim to detect events like stars fall- 
ing into black holes, or two black holes falling into one 
another. One of the key ideas is to detect the blast of gravi- 
tational radiation that occurs when the two massive objects 
are merging. Gravitational radiation is not something you 
can see with the naked eye, because what we see is light. 
Gravitational radiation is something completely different. It 
is a traveling wave of spacetime distortion. It carries energy, 
just like light does. It has a definite frequency, like light does. 
Light is composed of photons—little bits, or quanta, of light. 
And gravitational radiation, we think, is likewise composed 
of little quanta called gravitons. They obey the same relation 
E — hv between energy E and frequency v that photons do. 
They travel at the speed of light and are massless. 

Gravitons interact with matter much more weakly than 
photons do, so there is no hope of detecting them through 
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some analog of the photoelectric effect. Instead, the scheme 
for detection cuts directly to the fundamental nature of grav- 
itational radiation. When a gravity wave passes between two 
objects, the distance between them fluctuates. That’s because 
the spacetime between them itself fluctuates. The detection 
scheme, then, is to measure the distance between two objects 
very accurately and wait for it to fluctuate. If this scheme 
works, it will open up a whole new view of the universe. It 
will also be a spectacularly direct confirmation of the theory 
of relativity, which predicts gravitational radiation, whereas 
the previous Newtonian theory of gravity did not. 


The general theory of relativity 


Ive actually already told you a lot about general relativ- 
ity, indirectly. It’s the theory of spacetime that describes 
black holes and gravitational radiation. In general relativity, 
spacetime is not a static stage on which events occur, but a 
dynamic, curved geometry. Gravitational waves are ripples 
in this geometry, propagating like the ripples you get from 
throwing a stone into a lake. A black hole is like a stream 
draining the lake. Both analogies are imperfect. The main 
missing ingredient is a new version of time dilation that goes 
to the heart of general relativity. 

First, let me remind you about time dilation in special 
relativity. In special relativity, spacetime remains fixed. It’s 
all about how objects behave when they move relative to 
one another. Time dilation describes how time slows down 
when you've moving. The faster you move, the more time 
slows down. When you reach the speed of light, time stops. 

Here is the new feature of time dilation in general relativ- 
ity. Time slows down when you're deep down in a gravita- 
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tional well, like the one created by a massive star. When you 
get to a black hole horizon, time stops. 

But wait! I’ve previously claimed that there’s nothing spe- 
cial about a black hole horizon, except that once you fall in, 
you can't get out again. Crossing a horizon isn't a special 
experience. How can this be if time stops at a black hole 
horizon? The resolution is that time is a matter of perspec- 
tive. A rock climber falling through a horizon experiences 
time differently from the way you would if you hovered 
just a little bit above the horizon. An observer far from the 
black hole has yet a different experience of time. From the 
point of view ofa distant observer, it takes infinitely long for 
anything to fall into a horizon. If such an observer watched 
a rock climber fall into the black hole, it would seem like 
the climber crept ever closer to the horizon but never quite 
fellin. According to the climber’s own sense of time, it takes 
only a finite time for him to fall in, and only a finite addi- 
tional time to get to the center of the black hole where the 
singularity lurks. I would say that time slows down for the 
climber, because a second for him corresponds to a much 
longer time for the distant observer. Time also slows down 
for an observer who hovers slightly above the black hole. 
The closer he is to the horizon, the more time slows down. 

All this seems terribly abstract, but it has real-world con- 
sequences. Time runs slower at the surface of the Earth than 
it does in outer space. The difference is slight: it’s a little 
less than one part in a billion. But it matters for the Global 
Positioning System (GPS). The reason is that precision time 
measurements are part of what makes it possible for GPS to 
locate objects precisely on the surface of the Earth. Those 
time measurements suffer time dilation effects, both because 
the satellites are moving and because they’re not as far down 
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in Earth’s gravitational well as we are. Accounting properly 
for time dilation effects is a crucial ingredient in making 
GPS work as well as it does. 

I remarked earlier that there’s a connection between time 
dilation and kinetic energy. Let me remind you. Kinetic en- 
ergy is the energy of motion. Time dilation occurs when you 
are in motion. If you move so fast that you double your rest 
energy, then time runs halfas fast. If you move so fast that you 
quadruple your rest energy, then time runs a quarter as fast. 

There’s something very similar in the case of gravitational 
redshift, but it relates to gravitational energy. Gravitational 
energy is the amount of energy that you can gain by falling. 
Ifa piece of space debris falls to the Earth, the energy it gains 
by falling is a little less than a billionth of its rest mass. It’s no 
accident that this is the same tiny fraction that characterizes 
how much gravitational redshift there is on the surface of 
the Earth. Time running at different rates in different places 
is gravity. In fact, that’s all gravity is, provided gravitational 
fields are not too strong. Things fall from places where time 
runs faster to where it runs slower. That downward pull you 
feel, and which we call gravity, is just the differential rate of 
time between high places and low places. 


Black holes aren’t really black 


String theorists’ interest in black holes comes in large part from 
their quantum mechanical properties. Quantum mechanics 
turns the defining property of black holes on its head. No lon- 
ger are black hole horizons black. They glow like a live coal. 
But their glow is very faint, very cold—at least, if we're talk- 
ing about astrophysical black holes. The glow of a black hole 
horizon means that it has a temperature. That temperature is 
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related to how strong gravity is at the surface of the black hole. 
The larger a black hole is, the lower its temperature—at least, 
if we're talking about astrophysical black holes. 

Temperature is going to come up again, so we'd better 
discuss it more carefully. The right way to understand it is in 
terms of thermal energy, or heat. The heat in a mug of tea 
comes from the microscopic motion of the water molecules. 
When you cool water, you're sucking out thermal energy. 
Each water molecule moves less and less vigorously. Eventu- 
ally the water freezes into ice. That happens at zero degrees 
Celsius. But the water molecules in the ice are still moving a 
little: they vibrate around their equilibrium positions in the 
ice crystal. As you cool ice down further and further, these 
vibrations get weaker and weaker. Finally, at 2273.15 degrees 
Celsius, all such vibrations stop—almost: the water molecules 
are as fixed in their equilibrium positions as quantum uncer- 
tainty allows. You can't make something colder than —273.15 
degrees Celsius (which is —459.67 degrees Fahrenheit) because 
there’s no thermal energy left to suck out of it. This coldest of 
cold temperatures is called absolute zero temperature. 

It’s important that quantum mechanics prevents water 
molecules from ceasing to vibrate altogether, even at abso- 
lute zero temperature. Let’s explore this a little more. The 
uncertainty relation is Ap x Ax > h/4r. In an ice crystal, you 
know fairly precisely where each water molecule is. That 
means Ax is fairly small: certainly less than the distance be- 
tween neighboring water molecules. If Ax is fairly small, it 
means that Ap cannot be too small. So, according to quan- 
tum mechanics, the individual water molecules are still rat- 
tling around a little, even when they're frozen solid in a cube 
of ice at absolute zero. There is some energy associated with 
this motion, which goes by the name “quantum zero-point 
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energy." We've actually encountered it before when discuss- 
ing the hydrogen atom. If you recall, I compared the lowest 
energy of an electron in a hydrogen atom to the fundamen- 
tal frequency of a piano string. The electron is still moving. 
Both its position and its momentum have some uncertainty. 
Sometimes people describe this by saying that the electron 
undergoes quantum fluctuations. Its ground state energy 
can be termed the quantum zero-point energy. 

In summary, there are two types of vibrations happening 
in an ice cube: thermal vibrations and quantum fluctuations. 
You can get rid of the thermal vibrations by cooling the ice 
down to absolute zero. But you can't get rid of the quantum 
fluctuations. 

The idea of absolute zero temperature is so useful that 
physicists often quote temperatures in reference to it. This 
way of quoting temperature is called the Kelvin scale. One 
degree Kelvin—or, more commonly, one Kelvin—is one 
degree above absolute zero, or —272.15 degrees Celsius. 
273.15 Kelvin is 0 degrees Celsius, the temperature where 
ice melts. If you measure temperature on the Kelvin scale, 
then the typical energy of thermal vibrations is given by a 
simple equation: E = k,T, where k, is called Boltzmann's 
constant. For example, at the melting point of ice, this for- 
mula says that the typical energy of the thermal vibrations of 
a single water molecule is a fortieth of an electron volt. This 
is almost a hundred times smaller than the amount of energy 
it takes to knock an electron out of sodium, which, as you 
may recall from chapter 2, is 2.3 electron volts. 

Here are a few more interesting temperatures, just to get 
a feel for the Kelvin scale. Air turns into a liquid at about 
77 Kelvin, which is —321 degrees Fahrenheit. Room tem- 
perature (say, 72 degrees Fahrenheit) is about 295 Kelvin. 
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Physicists are able to cool small objects down to less than a 
thousandth of a Kelvin. On the other extreme, the surface 
of the sun is a little less than 6000 Kelvin, and the center of 
the sun is about 16 million Kelvin. 

Now, what does all this have to do with black holes? A 
black hole doesn’t seem to be made up of little molecules 
whose vibrations can be classified as thermal or quantum. 
Instead, a black hole is made up only of empty space, a hori- 
zon, and a singularity. It turns out that empty space is quite a 
complicated thing. It experiences quantum fluctuations that 
can be roughly described as spontaneous creation and de- 
struction of pairs of particles. If a pair of particles is created 
near a black hole horizon, then it can happen that one of the 
particles falls into the black hole and the other one escapes, 
carrying energy away from the black hole. This kind of pro- 
cess is what gives a black hole a non-zero temperature. To 
put it succinctly, a horizon converts some of the omnipres- 
ent quantum fluctuations of spacetime into thermal energy. 

Thermal radiation from a black hole is very faint, corre- 
sponding to a very low temperature. Consider, for example, 
a black hole formed in the gravitational collapse of a heavy 
star. It might contain a few times as much mass as the sun. 
Its temperature would then be about twenty billionths of a 
Kelvin, or 2 x 10° Kelvin. The black holes at the centers of 
most galaxies are much heavier: millions or even a billion 
times heavier than the sun. The temperature of a black hole 
five million times heavier than the sun would be about a 
hundredth of a trillionth of a Kelvin: that is, 10 "^ Kelvin. 

What fascinates string theorists is not so much the ex- 
treme lowness ofthe temperature of black hole horizons, but 
the possibility of describing certain objects in string theory, 
known as D-branes, as very small black holes. These very 
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small black holes can have a wide range of temperatures, 
from absolute zero to arbitrarily high values. String theory 
relates the temperature of small black holes to thermal vibra- 
tions on the D-branes. Ill introduce D-branes more care- 
fully in the next chapter, and I'll tell you more about how 
they relate to small black holes in chapter 5. This relation 
is at the heart of recent efforts to understand what happens 
in heavy ion collisions using string theory, as I'll discuss in 
chapter 8. 
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WHEN I WAS A SOPHOMORE AT PRINCETON, I TOOK A COURSE 
on Roman history. It was mostly about the Roman Repub- 
lic. It’s fascinating how the Romans combined peaceful and 
military achievements. They evolved an unwritten consti- 
tution and some degree of representative democracy while 
simultaneously overpowering first their neighbors, then the 
Italian peninsula, and finally the whole of the Mediterra- 
nean and beyond. Equally fascinating is how the civil strife 
of the late republic ended in the tyranny of the empire. 

Our language and legal system are filled with echoes of 
ancient Rome. For an example, look no further than the 
back of a quarter. If it was coined before 1999, then it shows 
an eagle perched on a bundle of sticks. This bundle is a fasces, 
a Roman symbol of strength and authority. The Romans 
also made influential contributions to literature, art, urban 
architecture and planning, and military tactics and strategy. 
The eventual adoption of Christianity in the Roman Em- 
pire helps account for Christianity’s prominence today. 
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As much as I enjoy Roman history, I wouldn't bring it up 
if it didn’t remind me of what I really want to talk about: 
string theory. We are deeply influenced by the Romans, but 
we are separated from them by a span of many centuries. 
String theory, if correct, describes physics at an energy scale 
far higher than we are able to probe directly. If we could 
probe the energy scales that string theory describes directly, 
then we would presumably see the various exotic things I 
am going to tell you about: extra dimensions, D-branes, du- 
alities, and so forth. This exotic physics underlies the world 
we experience (assuming string theory is correct), just as 
Roman civilization underlies our own. But string theory is 
separated from our experience of the world—not by centu- 
ries of time, but by a similarly vast gulfin energy scales. Par- 
ticle accelerators would have to be a hundred trillion times 
more powerful than the ones going into operation today to 
reach the range of energies where we think extra dimen- 
sions open up and stringy effects could be observed directly. 

This gulf in energy scales brings us to the most uncom- 
fortable aspect of string theory: it’s hard to test. In chapters 
7 and 8 [ll tell you about efforts to link string theory to 
experiment. In this chapter and the next two, I will instead 
try to convey string theory on its own terms, without reach- 
ing for connections to the real world except as explanatory 
devices. Think of these chapters as analogous to a brief sum- 
mary of Roman history. The narrative of Rome has many 
twists and turns. It's sometimes hard to follow. But we study 
the Romans to understand not only their world, but through 
it our own. String theory also has some surprising twists and 
turns, and I expect my explanations of them may not always 
be easy to get through. But there is at least a good chance 
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that a deep understanding of string theory will eventually 
be the basis of our understanding of the world. 

In this chapter, we'll take three important steps toward 
understanding string theory. The first is to understand how 
string theory resolves a fundamental tension between grav- 
ity and quantum mechanics. The second is to understand 
how strings vibrate and move in spacetime. The third is to 
get a glimpse of how spacetime itself emerges from the most 
widely used mathematical description of strings. 


Gravity versus quantum mechanics 


Quantum mechanics and general relativity are the great 
triumphs of early twentieth-century physics. But they turn 
out to be hard to reconcile with one another. The difficulty 
hinges on a concept called renormalizability. I will describe 
renormalizability by comparing photons and gravitons, both 
of which we've discussed in previous chapters. The upshot 
is going to be that photons lead to a renormalizable the- 
ory (which means, a good theory), whereas gravitons lead 
to a non-renormalizable theory—which is really no theory 
at all. 

Photons respond to electric charge, but they are not them- 
selves charged. For example, an electron in a hydrogen atom 
is charged, and when it jumps from one energy level to an- 
other, it emits a photon. That's what I mean by photons re- 
sponding to charge. Saying that photons are not themselves 
charged is the same as saying that light doesn't conduct elec- 
tricity. If it did, you'd get a shock from touching something 
that had been in the sun too long. Photons do not respond 
to one another because they only respond to electric charge. 
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Gravitons respond not to charge, but to mass and energy. 
Because they carry energy, they respond to themselves. They 
self-gravitate. This might not seem too problematic, but 
here’s how you get into trouble. Quantum mechanics teaches 
us that gravitons are particles as well as waves. A particle, by 
hypothesis, is a pointlike object. A pointlike graviton gravi- 
tates more strongly the closer you get to it. Its gravitational 
field can be understood as the emission of other gravitons. 
To keep track of all these gravitons, let’s call the original one 
the mother graviton. We'll refer to the gravitons it emits as 
daughter gravitons. The gravitational field very close to the 
mother graviton is very strong. That means that its daugh- 
ter gravitons have enormous energy and momentum. This 
follows from the uncertainty relation: daughter gravitons 
are observed within a very small distance Ax of the mother 
graviton, and so their momentum is uncertain by a large 
amount Ap, such that Ap x Ax > h/4n. The trouble is that 
gravitons also respond to momentum. The daughter gravi- 
tons will themselves emit gravitons. The whole process runs 
away: you can’t keep track of the effects of all these gravitons. 

Actually, there’s something similar that happens near an 
electron. If you probe the electric field very close to it, you 
provoke the electron into emitting photons with a large mo- 
mentum. That seems innocuous, because we’ve learned that 
photons can’t emit other photons. The trouble is, they can 
split into electrons and anti-electrons, which then emit more 
photons. What a mess! The amazing thing is that, in the case 
of electrons and photons, you actually can keep track of this 
multiplicity of particles cascading from one another. One 
speaks of a “dressed” electron to describe the electron and its 
cloud of progeny. The technical term for its progeny is vir- 
tual particles. Renormalization is the mathematical method 
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Left: An electron (e) produces virtual particles: photons (y), positrons 


(et), and more electrons. The cascade of particles is slow enough to keep 
track of mathematically, using renormalization. Right: A graviton (h) 
produces so many virtual gravitons so prolifically that renormalization 
can't keep track of them. 
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for keeping track of them all. The spirit of renormalization 
is that an electron all by itself may have infinite charge and 
infinite mass, but once it is dressed, its charge and mass be- 
come finite. 

The trouble with gravitons is that you can’t renormalize 
the cloud of virtual gravitons that surround them. Gen- 
eral relativity—the theory of gravity—is said to be non- 
renormalizable. This might seem like an arcane technical 
problem. There’s a faint chance that we’re just looking at 
the problem the wrong way. There’s also a chance, perhaps 
less faint, that a relative of general relativity, called maxi- 
mal supergravity, is renormalizable. But I, along with most 
string theorists, feel pretty sure that there is a fundamental 
diffculty merging quantum mechanics and gravity. 

Enter string theory. The starting assumption is that par- 
ticles are not pointlike. Instead they are vibrational modes 
of strings. A string is infinitely thin, but it has some length. 
That length is small: about 10 ?* meters, according to con- 
ventional ideas about string theory. Now, strings respond 
to one another in a fashion similar to gravitons. So you 
might worry that the whole problem with clouds of virtual 
particles—actually, virtual strings—would get out of con- 
trol, just as it did for gravitons. What stops this problem from 
arising is that strings aren't pointlike. The whole difficulty 
with gravity arose because point particles are, by assumption, 
infinitely small—hence the term “point particle.” Replacing 
gravitons with vibrating strings smoothes out the way they 
interact with one another. One way to say it is that when a 
graviton splits into two, you can identify an instant of time 
and a position in space where the split occurred. But when 
a string splits, it looks like a pipe branching. At the branch 
point, no part ofthe pipe wall is breached: the Y is a smooth, 
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string string 


string 


A graviton splits suddenly. A string splitting occurs over some region of 
spacetime, so it’s more gentle. 


solid piece of pipe, just in an unusual shape. What all this is 
leading up to is that the splitting of a string is a more gentle 
event than the splitting of a particle. Physicists say that string 
interactions are intrinsically "soft," whereas particle interac- 
tions are intrinsically "hard." It is this softness that makes 
string theory more well-behaved than general relativity, and 
more amenable to a quantum mechanical treatment. 


Strings in spacetime 


Let me remind you briefly of our earlier discussion of a vi- 
brating piano string. When you stretch it tightly between 
two pegs and pluck it, it vibrates with a definite frequency. 
The frequency is the number of vibrations per second. A 
piano string also has overtones of vibration: higher pitches 
that blend with its fundamental frequency to produce the 
particular sound you associate with a piano. I drew an anal- 
ogy to the behavior ofan electron in a hydrogen atom: it too 
has a preferred vibrational mode, corresponding to its lowest 
energy level, as well as other vibrational modes, correspond- 
ing to higher energy levels. This analogy might have left you 
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a little cold: what does an electron in a hydrogen atom really 
have to do with standing waves on a stretched string? It’s 
more like a particle rotating around the atomic nucleus—like 
an infinitesimally small planetoid rotating around a tiny little 
sun. Right? Well, yes and no: quantum mechanics says the 
particle picture and the wave picture are deeply related, and 
the quantum mechanical motion of the electron around the 
proton actually can be described as a standing wave. 

We can be much more direct in comparing a piano string 
with the strings of string theory. To distinguish between dif- 
ferent types of strings, let me call the strings of string theory 
“relativistic strings.” This term anticipates deep insights that 
we'll discuss soon, namely that strings incorporate the theo- 
ries of relativity, both special and general. For the moment, 
I want to talk about a string theory construction that is as 
close as I can manage to a stretched piano string. Relativistic 
strings are allowed to end on objects called D-branes. If we 
suppress the effects of string interactions, D-branes are infi- 
nitely heavy. We'll discuss them in a lot more detail in the 
next chapter, but for now they're just a crutch for our under- 
standing. The simplest of D-branes is called a DO-brane, usu- 
ally pronounced as “dee-zero brane.” It's a point particle. You 
might feel bothered by the fact that point particles just ap- 
peared again in the discussion. Wasn't string theory supposed 
to get rid of them? The fact is, it did for a while, and then 
in the middle 1990s point particles came back, along with a 
whole zoo of other things. But I'm getting ahead of the story. 
What I want is a string theory analog of the tuning pegs in 
a piano, and DO-branes are so appropriate that I can't resist 
introducing them. So, let's stretch a relativistic string between 
two DO-branes, like we'd stretch a piano string between two 
pegs. The DO-branes aren't attached to anything, but they 
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don't move because they are infinitely heavy. Pretty weird 
stuff, eh? I'll say more about DO-branes in the next chapter. 
What I really want to talk about here is the stretched string. 

The lowest energy level of the stretched string has no vi- 
brations. Well, almost none. There's always a little quantum 
mechanical vibration, and that's going to be important in a 
minute. The right way to understand the ground state is that it 
has as little vibrational energy as quantum mechanics allows. 
The relativistic string has excited states where it 1s vibrating, 
either at its fundamental frequency, or in one of its overtone 
frequencies. It can vibrate simultaneously in several different 
frequencies, just like a piano string does. But just as the elec- 
tron in a hydrogen atom cannot move in an arbitrary fashion, 
so too a relativistic string cannot vibrate arbitrarily. The elec- 
tron has to choose among a series of energy levels with finite 
spacing from one another. Similarly, the string has to choose 
among a series of vibrational states. The vibrational states have 
different energies. But energy and mass are related through 
E — mc. So the different vibrational states of a string have 
different masses. 

It would be nice if I could tell you that the vibrational 
frequencies of a string correspond in a simple way to its en- 
ergy, just as the equation E = hv connects the frequency and 
energy of a photon. There's something like that going on, 
but unfortunately it's not quite so simple. The total mass of a 
string comes from several different contributions. First, there's 
the rest mass of the string: the mass it has on account of being 
stretched from one DO-brane to the other. Next, there's the 
vibrational energy in each overtone. This contributes to the 
mass, because energy is mass according to E = mc’. Finally, 
there's a contribution coming from the minimal amount of 
vibration allowed by quantum uncertainty. This contribution 
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Motions of a string stretched between two DO-branes. 


from quantum fluctuations is called zero-point energy. The 
term "zero-point" is supposed to remind us that this quantum 
contribution can’t be gotten rid of. The contribution of the 
zero-point energy to the mass is negative. That's strange. Re- 
ally strange. To understand just how strange, consider the fol- 
lowing. If you restrict attention to a single vibrational mode of 
the string, the zero-point energy is positive. Higher overtones 
make bigger positive contributions to the zero-point energy. 
But when you sum them all up in an appropriate way, you 
get a negative number. If that isn't bad enough, here's worse 
news: I lied a little in saying that the contribution of zero- 


CHAPTER FOUR 


59 


point energy to the mass is negative. All these effects—rest 
mass, vibrational energies, and zero-point energy—add up to 
the total mass squared. So if the zero-point energy dominates, 
the mass squared is negative. That means the mass is imaginary, 
like VA. 

Before you dismiss all this as nonsense, let me hasten to add 
that a large swath of string theory is devoted to getting rid of 
the awful problem I described in the previous paragraph. To 
put this problem in a nutshell, a relativistic string in its least 
energetic quantum state has negative mass squared. A string 
in such a state is called a tachyon. Yes, those are the same 
tachyons that Star Trek characters confront in about every 
other episode. They're obviously bad news. In the setup I de- 
scribed, where the string is stretched between two DO-branes, 
you can get rid of them just by separating the DO-branes far 
enough so that the contribution to the mass from stretching 
the string is bigger than the quantum fluctuations. But when 
there aren't any DO-branes around, there are still strings. In- 
stead of ending on something, they close upon themselves. 
They arent stretched out at all. They can still vibrate, but they 
don't have to. The only thing they can't avoid doing is fluctu- 
ating quantum mechanically. And as before, those zero-point 
quantum fluctuations tend to make them tachyonic. This is 
bad, bad news for string theory. The modern view is that 
tachyons are an instability, similar to the instability ofa pencil 
balanced on its point. If you're extremely persistent and skill- 
ful, maybe you can balance a pencil that way. But the least 
breath of wind will knock it over. String theory with tachy- 
ons is kind of like a theory of the motion of a million pencils, 
distributed throughout space, all balanced on their points. 

Let me not paint things too black. There is a saving grace 
of tachyons. Let's accept that the ground state of a string is 
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a tachyon, with negative mass squared: m^ < 0. Vibrational 
energy makes rm less negative. In fact, if you play your cards 
right, the smallest increment of vibrational energy that 
quantum mechanics allows makes m exactly 0. That’s great, 
because we know there are massless particles in nature: pho- 
tons and gravitons. So if strings are to describe the world, 
there must be massless strings—more precisely, there must 
be vibrational quantum states of strings that are massless. 

I said something about playing your cards right. Exactly 
what does that mean? Well, it means that you need 26 di- 
mensions of spacetime. You probably knew this was coming, 
so I won't apologize for it. There are several arguments for 26 
dimensions, but most of them are pretty mathematical, and 
I'm afraid I couldn't make them sound at all convincing. The 
argument I have in mind hinges on the following points. 
You know you want massless string states. You know there 
are zero-point quantum fluctuations that push m^ negative. 
And you know there are vibrational modes that push m^ in 
the other direction. The smallest amount of vibrational en- 
ergy doesn't depend on the dimension of spacetime. But the 
zero-point quantum fluctuations do. Think of it this way: 
when something vibrates, like a piano string, it does so in a 
definite direction. A piano string vibrates in the direction in 
which it was struck. In a grand piano, that's up and down, 
not side to side. Vibrations pick out a direction and ignore 
all others. In contrast, zero-point quantum fluctuations go in 
every possible direction. Every new dimension you introduce 
gives the quantum fluctuations another direction to explore. 
More directions means more zero-point fluctuations, so a 
more negative contribution to m”. All that's left is to ask how 
things balance out between vibrations and the irreducible 
zero-point quantum fluctuations. It's a matter of calculation. 
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Cartoons of the quantum states of a string that make it act like a 
tachyon, or a photon, or a graviton. 


It turns out that the minimal amount of vibration cancels out 
against 26 dimensions’ worth of quantum fluctuation, lead- 
ing to massless string states as desired. Look on the bright 
side. It could have been 26 and a half dimensions. 

If you're getting confused between vibrations ofa string and 
the zero-point quantum fluctuations, don't be disappointed. 
They're really similar. The only difference is that vibrations 
are optional, and zero-point quantum fluctuations are not. 
Zero-point fluctuations are the minimum amount of vibra- 
tion required by uncertainty. Extra vibrations on top of that 
can still be quantum mechanical. It helps me to think of the 
vibrations as giving a string a characteristic shape: maybe 
circular, maybe cloverleaf, maybe folded over and spinning. 
Those different shapes are supposed to correspond to differ- 
ent particles. But speaking of the shape of a vibrating string 
is imprecise, because all the vibrations are quantum me- 
chanical. Better is to say that the different quantum modes 
of vibration of a string correspond to different particles. The 
shapes are mental pictures that help us visualize some of the 
properties of those quantum vibrations. 

To summarize, we've got good news, bad news, and 
worse news. Strings have vibrational modes and can act like 
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a photon or a graviton. That’s the good news. They can 
only do this in 26 dimensions. That’s the bad news. There’s 
also a vibrational mode of a string that has imaginary mass, 
the tachyon. It signals an instability of the whole theory. It 
doesn’t get much worse than that. 

Superstring theory cures the tachyon problem, and it low- 
ers the number of dimensions from 26 to 10. It also produces 
new vibrational modes that allow strings to act like electrons. 
Pretty cool stuff, all in all. If only there were a super-duper- 
string theory that cut the number of dimensions down to 4, 
maybe we'd be in business. I say this only half in jest. There 
actually is a version of super-duper-string theory, the more 
technical name for which is string theory with extended local 
supersymmetry. It cuts the number of dimensions down to 4. 
Unfortunately, those dimensions come in pairs, so you either 
get four spatial dimensions and no time, or two spatial dimen- 
sions and two times. Not good. We need three dimensions of 
space and one time. Of the ten dimensions that superstring 
theory requires, nine are spatial dimensions and one is time. 
To relate superstring theory to the world, somehow we have 
to do away with six of the nine spatial dimensions. 

There’s a lot I would like to tell you about superstrings, 
but most of it has to wait until later chapters. Let me focus 
here on a synopsis of how the tachyon problem gets cured. 
The superstring fluctuates not just in space and time but in 
other, more abstract ways. These other types of fluctuations 
go partway toward solving the tachyon problem, but not all 
the way. There’s still a vibrational mode with negative mass 
squared. The key to the story is that if you start with the vi- 
brational modes representing photons, electrons, and other 
particles that we want, no matter how you collide them, you 
can never make a tachyon. It feels like the whole theory is 
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still balanced on a knife edge. But it has a special symmetry 
that helps it stay balanced. That symmetry is called super- 
symmetry. Physicists hope to find evidence for supersym- 
metry in the next few years. If it is found, many of us would 
take it as confirmation of superstring theory. I'll discuss this 
more in chapter 7. 


Spacetime from strings 


I’ve talked a lot about strings vibrating and fluctuating in 
spacetime. Let’s take a step back and ask, just what is space? 
Just what is time? One view is that space derives meaning 
only through the objects present in space. What space de- 
scribes is the distance between objects. A similar view of 
time is that it’s meaningless by itself, but only describes the 
sequence of events. To make this more definite, consider a 
pair of particles, A and B. The conventional view is that each 
of them moves on some trajectory through spacetime, and 
they collide if the trajectories cross. Perhaps there’s nothing 
wrong with that. But let’s try to take the alternative view 
that space and time have no meaning in the absence of the 
particles. What would that mean? Well, to describe the tra- 
jectory of particle A, we could specify its position as a func- 
tion of time. And the same for particle B. If we could do 
that, then perhaps we could disregard space and time except 
as represented by the evolving position of the particles. We 
would still know if the particles collided, because they would 
have the same position and the same time when they hit. 

If this seems too abstract, think of the particles as racecars 
equipped with GPS devices and clocks. Let’s suppose the GPS 
devices record where the racecars are once a second. What 
could we learn from examining the records from the GPS 
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devices? Well, let’s suppose the racecars all move on the same 
racetrack. From the GPS records, the first thing you would 
notice is that the cars keep coming back to the same place after 
traveling a fixed distance—the distance around the track. So 
you would say, Aha! The cars travel on a circular track. Next, 
let’s say you notice that the cars speed up and slow down a lot. 
After scratching your head a bit, you might figure out that the 
racetrack isn’t circular after all! Instead it has curves, where 
the racecars have to slow down, and straightaways, where 
they can go fast. You might also observe that all the cars for 
which you have records go around the track in the same di- 
rection. You could correctly conclude that there’s a rule at the 
racetrack that everyone has to go the same way. Finally, you 
would notice that the cars have many near misses but seldom 
crash into one another. And you might reasonably deduce 
that a goal of car races is not to have collisions. 

The upshot is that just by looking at the GPS records of a 
number of racecars, and doing a lot of detective work, you 
could figure out quite a bit about the racetrack and the rules 
of driving on it. It may seem that this is a bass-ackwards 
way of finding things out that you could much more easily 
discover by watching an actual race. But in truth, watching 
a race is a very complicated activity. You're in a stand off to 
the side of the track—which already means that the race- 
track can't be all there is to spacetime. Watching means that 
photons are bouncing off the cars and going into your eyes, 
and that involves a lot of physics. It’s really a lot simpler to 
say that the GPS records of where all the cars were, second 
by second, contains the essential information about what 
happened on the racetrack. With such records in hand, you 
don’t have to ask about complicated things like observers in 
the stands and photons going hither and thither. You don’t 
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have to ask—in fact, you can’t meaningfully ask—whether 
there is anything in the world beyond the racetrack. You 
don’t even have to assume that the racetrack exists. Instead, 
you can deduce its existence and some of its properties by 
studying the records of how the cars moved. 

A lot of string theory is done in a similar way. From the 
way strings move and interact, you deduce properties of space 
and time. This approach is called worldsheet string theory. 
The worldsheet is a way of recording how a string moves. It 
is like the second-by-second GPS record of where a car is on 
a racetrack. It’s more complicated, though, for two reasons. 
First, a string can be long and floppy, so to say where it is, 
you have to say where every bit of it is. Second, as we've 
reviewed, strings usually live in 26 dimensions, or at least 
ten. These dimensions might be curved or rolled up in some 
complicated way. It is usually not possible to "stand to the 
side" and "look" at the geometry of spacetime the way you 
would watch cars at a racetrack. Meaningful questions are 
ones that can be phrased in terms of how strings move and 
interact. Spacetime itself, in the worldsheet approach, is only 
what the strings experience, rather than a fixed stage. 

The string worldsheet is a surface. If you cut across it, you 
get a curve, and that curve is supposed to be the string. Cut- 
ting across the worldsheet in different ways is like checking 
the GPS record of a car at different moments. In order to 
say how the string moves in spacetime, you have to specify 
a point in space and a moment in time for each point on the 
worldsheet. 'Think of it as attaching a whole bunch of labels 
to the worldsheet. When you cut across the worldsheet, the 
curve you get still has those labels, so it “knows” what shape 
in space it's supposed to assume. The worldsheet as a whole is 
the surface that the string sweeps out in spacetime as it moves. 
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Left: Two hills separated by a saddle. Right: a topographical map of the 
hills, with contours of constant elevation labeled. 


You can appreciate what I mean by labeling the world- 
sheet by thinking about a topographical map (topo for 
short). On a topo, there are lines of elevation, and each one 
1s labeled—or, if that's too many labels to read, sometimes 
people label one line in five. Now, the topo itself is a per- 
fectly flat piece of paper. But it represents terrain that can be 
quite hilly. 

One way to think of the string worldsheet is that it's like 
a topographical map of how the string is supposed to move 
in spacetime. But another point of view is that the string 
worldsheet is all there is, and spacetime is no more than the 
collection of labels you put on the worldsheet. In ordinary 
topographical maps, the labels are elevations, so the collec- 
tion of labels is just the range of possible elevations on the 
surface of the Earth: about —400 meters to 8800 meters if 
you exclude ocean floors. In worldsheet string theory, each 
label specifies a location in 26 dimensions (or ten in the case 
of superstrings). Some of those 26 dimensions can wrap back 
around and reconnect to themselves, like a racetrack does. 
The point is that the concept of spacetime emerges from 
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how you label the worldsheet, just as you could say that ele- 
vation “emerges” from the way you label topographic maps. 

Let’s recap and then get to one of the main punch lines of 
worldsheet string theory. Usually we think of strings vibrat- 
ing in spacetime. But space and time don’t have to be absolute 
notions. It’s better if they’re not, because then some outside 
dynamical principle can control the shape of spacetime. It 
happens that way in string theory. In the worldsheet approach 
to string theory, spacetime is just the list of labels allowed in a 
description of how the string moves. In a quantum mechani- 
cal treatment, these labels fluctuate a little. Now, here is the 
real punch line. It turns out that you can keep track of these 
quantum fluctuations only if the spacetime obeys the equa- 
tions of general relativity. General relativity is the modern 
theory of gravity. So quantum mechanics plus worldsheet 
string theory implies gravity. Pretty cool. 

Explaining how you “keep track” of the quantum fluc- 
tuations of spacetime labels on the string worldsheet would 
take us into excessively technical territory. But there is a 
point of contact with the racecar analogy that may help your 
intuition. If you recall, I suggested that you might guess 
that the racetrack has straightaways and curves by noticing 
that the racecars slow down to traverse certain parts of the 
track and then speed up in other parts. Well, one thing a 
racetrack almost never has is corners where you have to turn 
really suddenly. That’s because the cars would all have to 
stop at the corners, which would be no fun, and contrary to 
the spirit of a car race. Similarly, one of the things that the 
equations of general relativity almost entirely forbid is sharp 
corners in spacetime—usually called singularities. I say “al- 
most” because singularities are allowed behind black hole 
horizons. For the most part, you can understand the absence 
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of singularities in spacetime as analogous to the absence of 
corners on a racetrack. Strings can no more pass through 
most singularities than racecars can zip around a corner 
without stopping. But there are some exceptions. A fascinat- 
ing and large subject in string theory is understanding the 
types of singularities that are allowed. Usually these singu- 
larities cannot be understood in general relativity. So string 
theory actually allows a somewhat richer class of spacetime 
geometries than relativity does. It turns out that the extra 
geometries string theory allows are in some cases related to 
branes, which we will encounter in the next chapter. 
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IN 1989, AFTER MY JUNIOR YEAR IN HIGH SCHOOL, I WENT TO 
a physics camp. One of the things we did was to hear a lec- 
ture on string theory. About halfway through, one of the 
other students asked a sharp question. He said (more or less), 
“Why stop with strings? Why not work with sheets, or 
membranes, or solid three-dimensional chunks of quantum 
stuff?” The lecturer basically replied that strings seemed to 
be both difficult enough and powerful enough already, and 
that they seemed to be special in ways that membranes and 
solid chunks were not. 

Fast forward about six years, to 1995: The whole string 
theory community was electrified by the advent of D-branes. 
D-branes are exactly what the sharp student had asked for 
in 1989. They are objects in string theory that can have any 
number of dimensions. This chapter is mostly about D- 
branes and some of their amazing properties. I'll start with 
a brief account of the second superstring revolution, which 
was a tide of new ideas that swept through the field in the 
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mid-1990s. [ll tell you more precisely what a D-brane is, 
and Pll discuss the concept of symmetry and how it relates to 
D-branes. Next I’ll describe how D-branes relate to black 
holes. At the end, I'll get to some discussion of M-theory, 
which is an eleven-dimensional theory that is necessitated 
by string theory but not entirely part of it. 


The second superstring revolution 


The perspective on string theory that I presented in the previ- 
ous chapter is about what string theorists understood in 1989. 
They understood the danger of tachyons, the miraculous 
properties of the superstring, and the relationship between 
strings and spacetime. Another thing they understood, which 
I have scarcely mentioned, is compactification: the process of 
rolling up the six extra dimensions of superstring theory so 
as to be left with three dimensions of space and one of time. 
It all looked pretty good, because you had all the main in- 
gredients of fundamental physics. Gravity was there. Photons 
were there. Electrons and other particles were there. Inter- 
actions among them were about what one wanted. Clever 
compactifications seemed to give just about the correct list of 
particles—a list that stretches well beyond the ones I’ve men- 
tioned so far. But string theorists couldn't "close the deal" by 
producing a compactification that was just right, leading to 
precisely the physics that we observe in the real world. 
Looking back on that period, there was another problem. It 
was strings, strings, strings, all day every day. The understand- 
ing of the string worldsheet was profound, but the very depth 
of that understanding may have temporarily blinded people 
to the possibilities that eventually got explored in the second 
superstring revolution. It's hard for me to trace the history of 
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that period with complete accuracy, because I entered the field 
a little after the second revolution started. But it’s clear that 
hints started accumulating that strings weren't the whole story. 
Before starting a detailed discussion of branes, it seems to me 
worthwhile to summarize some of those hints and to give an 
overview of what the second superstring revolution was about. 

One hint was that interactions among strings became less 
and less controllable the more splitting and joining events 
there were. It was suggested that some new objects had to 
be added in order to handle string theory when splitting and 
joining interactions became strong. Another hint came from 
theories of supergravity. Supergravity is a low-energy limit 
of superstring theory. What I mean by a "low-energy limit" 
is that you throw away all but the lowest energy vibrational 
modes of the superstring. What you have left is the graviton 
and some other particles whose interactions are very precisely 
understood as long as they're not too energetic. It was ob- 
served that supergravity theories had some remarkable sym- 
metries that weren't visible in the worldsheet description of 
string theory. That seemed to indicate that the worldsheet 
description was incomplete. The broadest hint came from 
the construction of branes. A brane is like a string, but it can 
have any number of dimensions of spatial extent. A string is 
a 1-brane. A point particle is a O-brane. A membrane, which 
at any given instant of time is a surface, is a 2-brane. And 
there are 3-branes, 4-branes, 5-branes (two kinds!), 6-branes, 
7-branes, 8-branes, and 9-branes. With so many different 
branes present in string theory, it started to seem implau- 
sible that everything could be understood in terms of strings 
alone. A final hint came from eleven-dimensional super- 
gravity. It is a theory constructed using only two ideas: super- 
symmetry and general relativity. It has some connections to 
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form a closed loop. A 2-brane can close on itself to form a surface without 
boundary. A 3-brane can do something similar, but it’s hard to draw. 


the supergravity theories that come out of string theory, 
and those connections were understood well before the sec- 
ond superstring revolution. But it wasn’t at all clear how or 
whether it was related to worldsheet string theory. Worst of 
all, it didn’t incorporate quantum mechanics, and therefore 
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was viewed with skepticism by string theorists who were used 
to the idea that quantum mechanics and gravity are tightly 
intertwined. Eleven-dimensional supergravity was, in short, 
a mystery to string theorists: something close to what they 
were most interested in, but that didn’t entirely make sense. 
The field changed dramatically in a few short years in 
the middle 1990s, as these hints suddenly fell into a coher- 
ent pattern. Strings were still recognized as important, but 
it emerged that branes of various dimensions were also es- 
sential. At least in some circumstances, branes had to be 
placed on an equal footing with strings. In other circum- 
stances, branes could be described as zero-temperature black 
holes. Eleven-dimensional supergravity also fit beautifully 
into the new circle of ideas. It seemed so central, in fact, that 
it got a new name: M-theory. More properly, M-theory is 
whatever consistent quantum theory has eleven-dimensional 
supergravity as its low-energy limit. Sadly, the second super- 
string revolution fell short of giving a full description of what 
M-theory really is. What became clear, however, is that with 
the new toolbox that branes provide, one could understand 
string theory in a new way. Especially surprising was the real- 
ization that when string interactions are very strong, new ob- 
jects (often branes) offer simpler descriptions ofthe dynamics. 
Clearly, P've offered you only a brief survey of the ideas of 
the second superstring revolution. The rest of this chapter, 
and much of chapter 6, will be devoted to developing some 
of these ideas more fully. The best place to start is D-branes. 


D-branes and symmetries 


D-branes are a particular type of brane. Their defining 
property is that they are locations in space where strings 


BRANES 


74 


can end. It took a long time to realize that this simple idea 
can be developed into a remarkably rich understanding of 
how D-branes move and interact. D-branes have a definite 
mass which can be calculated starting just from the idea that 
strings can end on a D-brane. This mass becomes larger and 
larger when strings interact more and more weakly. A stan- 
dard working assumption in worldsheet string theory is that 
string interactions are very weak. The D-branes are then 
so massive that it’s hard to get them to move, and therefore 
hard to appreciate their role as dynamical objects in string 
theory. I suspect the prevalence of the assumption of weak 
string interactions prior to the second superstring revolu- 
tion was another reason that it took a while for D-branes 
to be recognized as dynamical objects in their own right. 

I introduced DO-branes in the previous chapter. They are 
point particles. D1-branes are like strings. They stretch out in 
one spatial dimension. They can close on themselves to form 
loops. And they can move in all sorts of ways, just like strings. 
That is, they can vibrate, and they have quantum fluctua- 
tions. A Dp-brane stretches out in p spatial dimensions. There 
are Dp-branes in 26-dimensional string theory, and also in 
ten-dimensional superstring theory. As I explained in chapter 
4, 26-dimensional string theory has an awful problem: the 
string tachyon, which is a kind of instability. A similar insta- 
bility afflicts D-branes in 26-dimensional string theory, but 
not in ten-dimensional superstring theory. Mostly, in the rest 
of this book, Pll be talking about superstring theory. 

A lot can be understood about D-branes by understand- 
ing their symmetries. I’ve used the word symmetry pretty 
freely so far. Now let me explain what physicists mean by 
this word. A circle is symmetrical. So is a square. But a circle 
is more symmetrical than a square. Here’s how I would jus- 
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Rotating a circle by any angle leaves it unchanged. Rotating a square by 
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90? leaves it unchanged, but rotating it by other angles does change it. 


tify that comparison. A square is the same if you turn it 90? 
degrees. A circle is the same no matter how you turn it. 
So there are more different ways of viewing the circle that 
make it look the same. That's what symmetry is all about. 
When something is the same when viewed from different 
angles, or in different ways, it has the quality of symmetry. 

Physicists (and mathematicians) are deeply attached to a 
slightly more abstract description of symmetry. The key con- 
cept is called a group, or a symmetry group. When you turn 
a circle, let's say by 90? to the right, it corresponds to an 
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“element” of the group. This “element” is a rotation by 90°. 
You don't have to be thinking of a circle to grasp the idea of a 
90° rotation. Think of it this way. Everyone understands the 
idea of a right turn. A right turn usually amounts to turn- 
ing 90° to the right. We can talk about right turns without 
discussing a particular intersection. We also understand that 
a left turn is the opposite of a right turn. If you start going 
north on 8th Avenue in Manhattan, make a right turn onto 
26th Street, and then a left onto 6th Avenue, your direction 
of travel is the same as it was when you started: north. Pl 
admit that not everything is the same. You're now driving on 
6th Avenue, whereas before you were on 8th. But suppose all 
you're trying to keep track ofis direction. Then, really, a right 
turn and a left turn cancel out, just like 1 and —1 add up to 0. 
There’s another thing you know about right turns and left 
turns—meaning turns by 90°. Three right turns amounts to 
a left turn. After four right turns, you're moving in the same 
direction you were going. This is very different from adding 
and subtracting numbers. Think of a right turn as a 1, and a 
left turn as a 1. Two right turns is 1 + 1 = 2. Two right turns 
and one left turn is 1 + 1 — 1 — 1, so the same as one right 
turn. So far so good. But four right turns is like not turning 
at all, which would suggest 1 + 1 + 1 + 1 = 0. Not good. 
This illustrates the difference between the "arithmetic" of 
right and left turns, and ordinary arithmetic. Mathematically, 
all there is to know about a group is how its elements add 
up. Well, not quite. You also have to know how to find the 
"inverse" ofa group element. The inverse ofa right turn is a 
left turn. Whatever a group element does, its inverse undoes. 
There's a certain similarity between this discussion and 
the one in chapter 4 about spacetime from strings. In that 
section, we started by thinking of the string worldsheet as an 
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abstract surface. Then we told it how to move in spacetime. 
Here we're thinking of a group as an abstract collection of 
elements. Then we decide how those group elements act on 
a particular object, like a circle, or a square, or a traveling car. 

I claim that the symmetry group of a square (more prop- 
erly, the group of rotational symmetries of a square) is the 
same one that describes right turns and left turns. A right 
turn means rotating by 90°. When you're driving, turning 
right also means that you go around a corner: you turn at 
the same time as moving forward. But as I’ve said, we're 
trying to keep track only of the direction you're facing, not 
your forward progress. If that's all we're thinking about, 
then this turning by 90? 1s just a rotation, as if we stopped in 
the middle of the intersection, turned the car by some magi- 
cal means, and then started going again. The point is that 
these 90? turns are exactly the ones that describe the rota- 
tional symmetries of a square. A circle is more symmetrical 
yet, because you can turn it by any angle and it's the same. 

Is there anything more symmetrical than a circle? Sure: 
a sphere. If you take a circle and turn it out of the plane in 
which it lies, clearly it's not the same. But a sphere is the 
same no matter how you turn it. It has a bigger group of 
symmetries than a circle. 

Now let’s get back to D-branes. It’s hard to keep track of 
ten or 26 dimensions, so let’s just imagine that we’ve some- 
how done away with all but the usual four. A D0-brane 
has the symmetries of a sphere. Any point particle does—at 
the level of our current discussion. The reason is just that a 
point looks the same from any angle, just like a sphere. D1- 
branes can have many shapes, but the simplest to visualize 
is when it is perfectly straight, like a flagpole. Then it has 
the symmetries of a circle. If that doesn’t make sense, think 
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of a D1-brane rising straight up out of a sidewalk. OK, that 
was silly—think of a flagpole in the middle of a sidewalk. 
You can't really rotate the flagpole: it's too heavy. But you 
can look at it from different sides. It looks the same from all 
angles. The same is true of a circle drawn on the sidewalk. 
You can't turn it, but you can look at it from every angle, 
and it's the same. 

Symmetry is an elaboration of the notion of sameness. 
So it may seem like it gets boring pretty quickly. Same old 
same old, eh? But there are a couple more elaborations that 
make it all seem a lot more exciting to me. First, think ofa 
turntable. (For people younger than the author, it may help 
to be reminded that a turntable is the part of a record player 
that you put a record on.) If it's a really good one, with no 
wobbles, it is hard to tell by looking whether it is on or off. 
That’s because it has the symmetries of a circle. But now 
imagine putting a record on it. You can tell that it's turn- 
ing now because the center label usually has some words 
printed on it. But let's ignore that for now. The grooves 
on the record are in a spiral pattern. If you look closely, 
you can see that spiral moving. It looks like each individual 
groove is moving slowly, slowly inward. If you put a needle 
on the record, it follows the grooves inward. If you jiggered 
the turntable so that it turned backward, the needle would 
move slowly outward. The point is that continuous turn- 
ing is not the same as staying still. We don’t really need the 
record to tell us this: it just helped illustrate that rotational 
motion can be detected in obvious ways or in subtle ones. 
We could have just said that the turntable is continuously 
undergoing rotation and left it at that. 

Particles like electrons and photons are forever rotating. 
The term physicists prefer 1s that they're spinning, like tops. 
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Electrons can spin in any direction they like: that is, the axis 
of their rotation can point in any direction. Physicists com- 
monly refer to the axis of rotation of a spinning electron 
as the direction of its spin. This axis of rotation can itself 
change over time, but it does so only under the influence of 
electromagnetic fields. Atomic nuclei spin in essentially the 
same manner as an electron. Magnetic resonance imaging 
(MRI) takes advantage of this. With an intense magnetic 
field, an MRI machine aligns the spins of the protons in hy- 
drogen atoms in the patient’s body. The machine then sends 
in a radio wave that tips the axes of some of the protons’ 
spins. As the spins come back into alignment, they emit some 
additional radio waves. These emitted radio waves can be 
thought of as echoes of the ones that the MRI machine sent 
in. With a lot of sophistication and experience, physicists and 
doctors have learned how to “listen” to those echoes and 
figure out what they tell about the tissue that produced them. 

Photons spin too, but not in just any direction. The axis 
of their rotation has to be aligned with the direction of their 
motion. This restriction cuts to the heart of modern particle 
physics, and it’s a consequence of a new kind of symmetry, 
called a gauge symmetry. The word “gauge” refers to a sys- 
tem of measurement or a measurement device. For example, 
a tire pressure gauge is a device for measuring tire pressure, 
and the gauge of a shotgun is a way of stating the diameter 
of the barrel. In physics, when an object can be described in 
several different ways, and there’s no a priori reason to prefer 
one over another, a gauge is a specific choice of which de- 
scription to use. Gauge symmetry refers to the equivalence of 
different gauges. Gauges and gauge symmetries are pretty ab- 
stract notions, so let’s consider a commonplace analogy before 
going further. I remarked earlier that it’s hard to tell whether 
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a turntable is turning or not, because it’s symmetrical. A con- 
venient way to remedy this is to mark the edge of the turn- 
table with a dot of whiteout. It doesn’t matter where on the 
edge you put the whiteout: For instance, you could put it on 
the side nearest you, or you could reach across and mark the 
other side. Wherever the mark is, its motion lets you tell at 
a glance that the turntable is turning. The choice of where 
to put the mark is like a choice of gauge. The arbitrariness of 
where you decide to put the mark is like gauge symmetry. 
Gauge symmetry has two important consequences for the 
quantum mechanical description of photons. First, it ensures 
that the photon is massless, so that it always travels at the 
speed of light. Second, it restricts the axis of the spin to be al- 
ways aligned with the direction of motion. It’s hard for me to 
explain how these two restrictions arise from gauge symme- 
try without delving into the mathematics of quantum field 
theory. But what I can do is to explain the relation between 
them. Consider first an electron, which has both mass and 
spin. If the electron is at rest, it wouldn’t make sense to say 
that its spin must be aligned with its motion, simply because 
it isn't moving. A photon, on the other hand, must always 
move at the speed of light. You can't move without moving 
in some direction. Thus it at least makes sense to restrict the 
axis of a photon's spin to be aligned with the direction of its 
motion. In short, the first restriction (masslessness) is neces- 
sary in order for the second (spin alignment) to make sense. 
The consequences of gauge symmetry make it seem like a 
very different idea from the symmetries we discussed previ- 
ously. It's more like a set of rules. A photon cant stand still, 
because of gauge symmetry. It can’t spin in certain directions, 
because of gauge symmetry. There's one more important 
thing to know: electrons have charge because of gauge sym- 
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Photons as waves and as particles. In the particle description, the axis of 
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the spin is aligned with the motion. In the wave description, the electric 
field has a corkscrew shape. If all the photons spin the same way, as I 
drew, the light is described as "circularly polarized." 


metry. An analogy between gauge symmetry and the rota- 
tional symmetry of a turntable helps illustrate this last point. 
Gauge symmetry for an electron is like rotational symmetry: 
one even speaks of “gauge rotations." But a gauge rotation 
isn't a rotation in space. It's more abstract, and it relates to the 
way one describes electrons quantum mechanically. Just as a 
turntable rotates at a constant rate (when it's turned on), an 
electron “rotates” in a more quantum mechanical sense, re- 
lated to gauge symmetry. This rotation is its electric charge. 
Electric charge is negative for electrons and positive for pro- 
tons. That means they "rotate" in opposite directions in the 
abstract sense that relates to gauge symmetry. 

It turns out that extra dimensions help make the whole 
discussion of charge more concrete. If there's one extra di- 
mension, and its shape is a circle, then you can imagine a 
situation where a particle is going around the circle. It can go 
around forward or backward. If the circle is really small, you 
would not notice it like the usual four dimensions. Neverthe- 
less, elementary particles could go around the circle, forward 
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or backward. If they went forward, they would have positive 
charge. If they went backward, they would have negative 
charge. The whole setup relies on a circular extra dimension, 
so it shouldn't surprise you to learn that the symmetries of 
a circle have a lot to do with gauge symmetry. In fact, the 
gauge symmetry of electric charge is the same as the symme- 
try ofa circle. That may seem like an abstract statement. But 
it has an immediate consequence. Motion in a circle is either 
forward or backward. There isn’t any other direction. In the 
same way, electric charge is either positive or negative. There 
isn’t any other kind of charge. 

The idea of explaining electric charge in terms ofa circular 
extra dimension predates string theory. It is a little less than 
a hundred years old. However, it’s never been made to work 
quantitatively. Part of the grand ambition of string theory is 
to make this idea come to fruition. We certainly have a bunch 
of extra dimensions to play with, so there should be some 
hope. Whether we're on the right track or not with extra di- 
mensions, the idea of gauge symmetry is here to stay. Electric 
charge and its interactions are fundamentally connected to 
the symmetries of a circle and motions around a circle. 

It seems we have wandered far from D-branes. But not 
really. D-branes provide examples of everything we've dis- 
cussed. We've already seen how D-branes have rotational 
symmetries: recall the comparison between a D1-brane and a 
flagpole, whose rotational symmetry is the same as the sym- 
metry of a circle. Rotational symmetries help explain the 
properties of D-branes. But gauge symmetry plays a big role 
too. Here's the first hint of the connection between gauge 
symmetry and D-branes. If you start with a D1-brane that 
1s stretched out straight, and you tap it in a particular place, 
two little ripples will move out from where you tapped it. 
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These ripples will move at the speed of light. They are like 
massless particles. Nothing will make them stand still. I’ve 
explained that massless particles like photons are associated 
with a gauge symmetry, and that their property of being 
massless is enforced by the gauge symmetry. That’s essen- 
tially what's going on with the ripples on a D1-brane. I’m 
oversimplifying, because these ripples aren’t quite like pho- 
tons. They have no spin. But, if we were to discuss ripples on 
a D3-brane, then some of them do have spin, and they have 
exactly the same mathematical description as photons. Pretty 
much as soon as D3-branes were invented, people started 
trying to build a model of the world in which the dimen- 
sions we experience are the ones on a D3-brane. There are 
still extra dimensions, but we can't get at them because we're 
stuck on the brane. What seems to give this idea a chance is 
that D3-branes come equipped with photons. All we need is 
the other fifteen or so fundamental particles, and we'd be set. 
Sadly, a D3-brane by itself doesn't provide them. It's an ac- 
tive area of research to find out what other ingredients you'd 
need in order to build the world on a D3-brane. 

D-branes in superstring theory also have charge, simi- 
lar to electric charge. The analogy is quite precise in the 
case of DO-branes. They have a charge that we could say 
is +1. There is another object, an anti-D0-brane, that car- 
ries charge —1. Now, remember our discussion of the almost 
hundred-year-old idea that charge is associated with a cir- 
cular extra dimension? It works perfectly for DO-branes. One 
of the breakthroughs of the second superstring revolution was 
that superstring theory was concealing an extra dimension, 
beyond the ten that we were used to. A DO-brane, which 
you'll remember looks like a point, can be described as a 
particle moving around that eleventh dimension, which is 
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rolled up in a circle. Ifa particle moves the other way around 
the eleventh dimension, it's an anti-DO-brane. This realiza- 
tion is what made people suddenly take eleven- dimensional 
supergravity seriously. In some sense, string theorists were 
studying it all along without realizing it! And it turns out 
that the eleventh dimension doesn't have to be curled up 
in a small circle. As you make the circle bigger and bigger, 
interactions between superstrings get stronger and stronger. 
They split and join so rapidly that it seems hopeless to try to 
keep track of them. But as the dynamics of the string pic- 
ture becomes more complicated, a new dimension literally 
opens up. Eleven-dimensional supergravity becomes the 
simplest description of the strongly interacting superstrings. 
We don't know exactly how to merge quantum mechan- 
ics with eleven-dimensional supergravity. But we feel con- 
vinced that there must be some way to do it, because string 
theory is a fully quantum mechanical theory, and it clearly 
includes eleven-dimensional supergravity when the super- 
string interactions become strong. It is this circle of ideas 
that soon got the name of M-theory. 

A great hope of string theorists is that all our notions of 
charge and gauge symmetry might simply stem from the se- 
cret higher-dimensional nature of the world. In chapter 7 I'll 
discuss more fully how this might work. In chapters 6 and 8, 
Pll explain how extra dimensions might be used to describe 
the strong interactions, like the interactions among quarks 
and gluons inside the proton. To give you a brief preview: 
In some circumstances, or in some approximation, these in- 
teractions might be effectively described in terms of a fifth 
dimension. This fifth dimension "opens up;" like the elev- 
enth dimension of M-theory, when interactions become too 
strong to keep track of in the usual four dimensions. 
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D-brane annihilation 


As I explained in the previous section, DO-branes carry a 
charge, and there is another object, called an anti-D0-brane, 
that carries the opposite charge. What would happen if a DO- 
brane collided with an anti-D0-brane? The answer is that 
they would annihilate each other, disappearing in an explo- 
sive burst of radiation. This section is devoted to describing 
in more detail how DO-branes and anti-D0-branes interact. 
To get started, let’s go back to the discussion in chapter 
4 of strings stretched between DO-branes. The purpose of 
that discussion was to tell you about the three contributions 
to the mass of a string. There was the rest mass, which came 
from stretching the string between the branes. There were 
the vibrations, which were like the motion of a piano string 
when struck. And there was the contribution of quantum 
fluctuations, which was negative, and very hard to get rid 
of. And very problematic, because it led to tachyons—things 
with imaginary mass. I mentioned that one way to get rid 
of tachyons was to move the DO-branes far enough apart 
so that the stretching energy was bigger than the negative 
contribution from quantum fluctuations. Well, let’s turn this 
around. What if we start with DO-branes far apart, and then 
bring them closer and closer together? The answer depends 
on details. To get the story straight, we have to distinguish 
carefully between DO-branes and anti-DO-branes. The only 
difference between them is their charge. Consider first the 
case of two DO-branes approaching one another. They have 
the same charge. That means that they repel one another the 
way electrons do. But they also have mass, so they exert a 
gravitational attraction on one another. The overall attrac- 
tion precisely cancels out the repulsion. The upshot is that 
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they scarcely notice one another. And it turns out that su- 
perstrings stretched between two DO-branes never turn into 
tachyons. This is a small example of the miraculous solution 
of the tachyon problem in superstring theory. 

Everything changes when you consider a DO-brane close 
to an anti-D0-brane. The D0-brane and the anti-D0-brane 
have opposite charges. So they attract one another, like an 
electron and a proton. The gravitational attraction is un- 
changed, just because DO-branes and anti-DO-branes have 
the same mass, and gravity responds to mass. The upshot is 
that there's a strong attraction between a DO-brane and an 
anti-D0-brane. Strings stretched between them know about 
this attraction. The way they know about it is that they be- 
come tachyons when the DO-brane and anti-D0-brane get 
too close. I noted in the previous chapter that the modern 
understanding of a tachyon is that something is unstable. The 
example I gave was a pencil standing on its point. Eventually, 
it falls over. A DO-brane sitting right on top of an anti-D0- 
brane is similarly unstable. What happens, as I remarked at 
the beginning of this section, is that they mutually annihi- 
late. The annihilation process is analogous to the pencil fall- 
ing over. You can get an alternative view of it by thinking of 
the eleventh dimension in the shape of a circle. A DO-brane 
is a particle going around the circle. An anti-D0-brane is a 
particle going the other way. If the DO and anti-D0 are right 
on top of one another, the particles will collide. When they 
do, the D-branes disappear in a flash of radiation. The details 
of this process should teach us something about M-theory, 
but unfortunately it's not very well understood. The trouble 
1s that the annihilation process is very rapid, and it's hard to 
track the way in which a large amount of energy gets released 
in a short time. What we can be sure of, based on E = mc’, 
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Left: A DO-brane and an anti-D0-brane come together and annihilate 
into strings. A string stretched between them becomes tachyonic when 
the branes get too close. Tachyonic means unstable. The tachyon is a 
quantum of instability. Right: When the DO is far from the anti-D0, 
the would-be tachyon is actually stable. When the DO and anti-DO 
come too close, the tachyon rolls away. This rolling is equivalent to the 
annihilation of the DO and anti-DO0. 
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is that the energy released is twice the rest energy of the D0- 
brane, plus any kinetic energy that the DO and anti-D0 may 
have had before the annihilation. 


Branes and black holes 


I introduced D-branes as locations in spacetime where 
strings are allowed to end. It turns out that there is another 
way to think about them: they are zero-temperature black 
holes. This way of thinking is best when there are many D- 
branes on top of one another. Let's start with DO-branes. As 
I just explained in the previous section, in superstring theory, 
two DO-branes don't exert any net force on one another. 
Their gravitational attraction is canceled by their electro- 
static repulsion, and they don't annihilate each other like a 
DO0-brane and an anti-D0-brane do. So we can consider two 
DO-branes on top of one another, or indeed any number, 
without worrying about violent processes like annihilation. 
However, the more DO-branes you have, the more the nearby 
spacetime is distorted around them. The distortion takes the 
form of a black hole horizon. To make this seem more plau- 
sible, think of a million DO-branes sitting on top of one an- 
other, and one lonely DO-brane moving nearby. The lonely 
DO-brane is neither attracted nor repelled. Actually, there's 
a caveat to that statement. The lonely DO-brane doesn't feel 
any net force at all i£ it isn't moving. If it is moving, then it 
does feel just a little tug toward the other branes. Similar 
tugging helps prevent the million DO-branes from dispers- 
ing. Everything is quite different for an anti-DO-brane. It 
feels both gravitational and electrostatic attraction, just like I 
described before. When it gets really close to the big clump 
of a million DO-branes, it's like one of those fish in the lake 
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who ventured too close to the drain. It gets sucked in. No 
physical process can save it when it gets closer than a certain 
distance. That’s basically the notion of a black hole horizon. 
What about the claim that the horizon has zero tempera- 
ture? This is harder to explain. It has to do with the behavior 
of the lonely DO-brane, which feels no net force from the 
clump. It turns out that this no-force condition is deeply re- 
lated to zero temperature. Both properties are enforced by 
supersymmetry. I defer a careful discussion of supersymmetry 
to chapter 7, but let's add incrementally here to our familiar- 
ity with supersymmetry with the following two statements. 
First: Supersymmetry relates gravitons and photons. Gravi- 
tons control gravitational attraction. Photons control elec- 
trostatic attraction or repulsion. The particular relation that 
supersymmetry implies between gravitons and photons just 
says that gravitational and electrostatic forces are equal. Sec- 
ond: Supersymmetry guarantees that DO-branes are stable. 
What this means is that there's no lighter object in string the- 
ory that a DO-brane can turn into—unless it encounters an 
anti-DO-brane. So a DO-brane, though heavy, is quite unlike 
a uranium-235 nucleus, which can decay into lighter nuclei 
such as krypton and barium, as I discussed in chapter 1. 
Clumps of DO-branes are also stable. They can't decay 
into anything else. The only thing they can do when they are 
together is vibrate a little. These vibrations are like the ther- 
mal vibrations of atoms in a lump of coal. If you remember, 
thermal vibrations can be translated into energy according to 
the formula E = k,T. Here E is the extra energy due to the 
thermal vibrations. For example, when you apply this for- 
mula to a carbon atom in a lump of anthracite coal, E is the 
extra energy of the atom due to its thermal vibrations, not 
its rest energy. The total energy of the lump of coal should 
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Top left: A clump of DO0-branes with thermal energy. Top right: 

A horizon forms around the DO-branes to describe their thermal 
properties. Bottom left: Three D3-branes on top of one another. Strings 
running between the branes act like gluons and can provide thermal 
energy. Bottom right: A horizon forms around the D3-branes to describe 
their thermal properties. 


include the rest energy of all the atoms and also the energy of 
their thermal vibrations. The atoms also have some quantum 
fluctuations in their positions, and in principle that also gets 
included in the total energy of the coal. It's all pretty similar 
to our earlier discussion of the three contributions to the 
mass of a string. The total mass of the lump of coal can be 
figured from its total energy using E = mč. 

Now, all this discussion of coal can be carried over to the 
clump of DO-branes. They've got rest mass, and they've got 
some quantum fluctuations. In the case of DO-branes, the 
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quantum fluctuations make exactly zero contribution to the 
total mass. (It's always a headache to track those quantum 
fluctuations!) The DO-branes might have some thermal fluc- 
tuations too. If they do, then the clump of DO-branes has a 
temperature, and it has extra mass. But it doesn't have extra 
charge. Now, if that lonely DO-brane happens to be close to 
a clump of DO-branes at some non-zero temperature, then 
the extra mass is going to add a little extra oomph to the 
gravitational pull on the lonely DO. So it will be drawn in. If 
you cool off the clump of D0-branes to absolute zero, it loses 
that extra bit of mass. So it no longer exerts any force on the 
lonely DO-brane. This is how zero temperature is associated 
with a no-force condition. 

If all this talk of DO-branes is losing you, let's take a break 
and talk about coal some more. Its thermal vibrations get in- 
cluded in its total energy, just as for the clump of DO-branes. 
This total energy is still the energy of the coal at rest. "At 
rest" just means that the coal is sitting there, as opposed to 
flying through the air. Total rest energy gets translated into 
total mass via E = mc’. So the lump of coal is actually heavier 
when it's hot than when it’s cold, just as a clump of D0- 
branes is heavier when it's hotter. In the case of a lump of 
coal, you can plug in some everyday numbers and figure out 
just how much mass the coal gains from being hot. Here's 
how I would do it. A really hot coal is about 2000 Kelvin. 
If you remember, the surface of the sun is only about three 
times hotter. E = k,T' is an estimate ofthe thermal energy in 
each atom of coal—but only an estimate. Using this estimate 
without trying to improve on it, I calculated that the thermal 
energy in a hot coal is about 10 " times its rest mass. That's 
one part in a hundred billion. This is much more than the 
fraction of rest mass that an Olympic sprinter can turn into 
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kinetic energy in the hundred-meter dash. But it is much, 
much less than the fraction of rest mass converted to energy 
in nuclear fission. That’s basically why nuclear power is so 
promising: a ton of fuel-grade uranium, used in a modern 
nuclear reactor, yields about the same amount of electrical 
energy as a hundred thousand tons of coal. 

I doubt you'll be happy to hear it, but the discussion of D0- 
branes was oversimplified in two ways. First, there’s another 
interaction among DO-branes, mediated by a massless particle 
that is neither a photon nor a graviton. This particle is called a 
dilaton, and it has no spin. Everything I said about gravitational 
attractions should really have been extended to include the di- 
laton. But even with that little change, the final conclusions are 
the same. Second, if DO-branes are behind a horizon, it’s hard 
to say whether they’re vibrating like atoms do. All you can say 
for sure is that the clump of DO-branes has some extra energy, 
which is the same as extra mass. A big problem in string theory 
is to give a more precise description of black holes made out 
of vibrating D-branes. The best-understood case involves D1- 
branes and D5-branes. Another important case is D3-branes. 
The DO-brane case is harder to work out quantitatively, but 
there has nevertheless been significant progress. 

When we switch from discussing the black hole view of 
D0-branes to D1-branes or D3-branes, the main thing that 
changes is the shape of the horizon. D3-branes surrounded 
by a black hole horizon are hard to visualize because the D3- 
branes extend over three spatial dimensions. You have to vi- 
sualize at least one more dimension to get a proper idea of 
what the horizon looks like. In later chapters, I’m going to 
work on explaining this case some more, because it’s really the 
most interesting. For now, let’s consider D1-branes in the four 


spacetime dimensions of everyday experience—assuming, as 
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we did before, that we somehow did away with the other six. 
When a single D1-brane is stretched out straight, it looks like 
a flagpole, and its fluctuations are the ripples I described ear- 
lier Gee p. 82). When many D1-branes come together, there 
are more types of ripples. The best way to think of these rip- 
ples is in terms of strings. A string can have one of its ends on 
one D1-brane, and its other end on another. It can slide along 
the D1-branes in the direction they’ve been stretched out. A 
string with ends is generally called an open string. That’s in 
contrast to a closed string, which by definition is a closed 
loop. Adding thermal energy to D1-branes basically means 
adding open strings. Surprisingly, the open strings describe 
all possible small vibrations of the D1-branes. In other words, 
the strings are essentially the ripples on the D1-branes. 

If there are a large number of D1-branes, then the whole 
assembly of D1-branes and open strings on them distorts the 
nearby spacetime, and a black hole horizon forms. The ho- 
rizon has the symmetries of a circle, just as a single stretched 
D1-brane does. You can think of the horizon as a cylinder 
that surrounds the assembly of D1-branes and open strings. 
That's a different shape from the horizon around a clump 
of DO-branes, which is spherical. Some string theorists pre- 
fer to use the term "black brane" to describe a group of 
D1-branes surrounded by a horizon. They reserve the term 
"black hole" for a spherical horizon, like the one surround- 
ing DO-branes. I favor a slightly looser usage: black branes, 
black holes, whatever rolls off the tongue more easily. For 
example, I would describe the cylindrical horizon sur- 
rounding a group of D1-branes as a black hole horizon, and 
I would refer to the whole geometry as a black D1-brane. 

Historically, it's interesting to note that the black hole 
(or black brane) geometries describing clumps of D-branes 
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were known before D-branes themselves were properly un- 
derstood. To understand black branes, all you need is to be 
able to solve the equations of supergravity. If you recall, su- 
pergravity is a low-energy limit of superstring theory, where 
you forget about all the overtones of string vibration and 
focus in on vibrational modes that are massless. Supergrav- 
ity is still pretty complicated. But it’s a lot simpler than su- 
perstring theory. The construction of black branes is one 
of several ways in which supergravity helped guide the de- 
velopment of string theory during the second superstring 
revolution. 


Branes in M-theory and the edges of the world 


So far, I have focused my discussion of branes on D-branes. 
I did this because D-branes are the most important, best un- 
derstood, and most diverse collection of branes. But it would 
be a distortion to leave out the other branes. Partly this is 
because they are weirder than D-branes. There’s probably 
more left to be discovered about them. Weirdest of all are 
the branes of M-theory. 

M-theory, I remind you, is the quantum mechanical the- 
ory that includes eleven-dimensional supergravity as its low- 
energy limit. Although M-theory is more than ten years old 
at the time of writing this book, the statement I’ve just made 
is still the most important thing we know about it. I would 
not hesitate to say that this is a disappointment. Still, there’s 
a lot to eleven-dimensional supergravity. In particular, it in- 
cludes two black branes: the M2-brane and the M5-brane. 
They are similar to the black branes in string theory that 
describe groups of D-branes surrounded by a horizon. They 
are especially similar to the black D3-brane. 
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M2-branes are extended in two spatial directions, and 
M5-branes in five. Like D-branes, they can be stretched out 
straight in the eleven dimensions of M-theory, or they can 
wrap around and close on themselves. Unfortunately, we 
don’t understand very well how M-branes fluctuate. We can 
track the motions of a single M2-brane that’s stretched out 
and nearly flat. Its motions are like the ripples on D1-branes 
that I described in the previous section. We can similarly 
track the motions of a single M5-brane. But when multiple 
M-branes sit on top of one another, the story becomes more 
complicated, and it has defied understanding for many years. 
Literally as I write this chapter, this wall of ignorance seems 
to be cracking. A handful of papers have appeared that pur- 
port to describe the dynamics of two or more M2-branes 
on top of one another. But we are still far, far away from the 
level of detailed understanding we have in the case of strings. 
We can keep track of how a string moves, both classically and 
quantum mechanically, whether the string is nearly straight 
or flopping all over the place. There are still some conceptual 
roadblocks to a similar understanding of M2-branes. And 
M5-branes are, if anything, even more mysterious. 

There’s one further type of brane in M-theory that is 
really surprising. This brane is the edge of spacetime. It’s 
where space itself ends. Usually, in string theory, space can’t 
end, any more than a string can end without a D-brane 
around. The space-ending brane is one of the wilder ideas 
in M-theory, but it’s actually very well accepted. It turns 
out that there are photons at the edge of spacetime, much 
like the photons on D-branes. But the photons at the edge 
of spacetime participate in a particularly interesting theory 
called supersymmetric Es gauge theory. A lot of work in the 
middle 1980s, after the first superstring revolution, revolved 
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around folding this theory up in such a way as to recover the 
theories of electromagnetic and nuclear forces. It turns out 
that all this work has an M-theory interpretation in terms of 
spacetimes that end on a space-ending brane. 

The space-ending brane is one of the ways in which M- 
theory has moved decisively beyond eleven-dimensional 
supergravity. This advance required some use of quantum 
mechanics. Another such advance was the calculation of the 
mass of M2-branes and M5-branes. Actually, the mass of 
an M2-brane is infinite when it is stretched out straight and 
flat across an infinite area. The same goes for M5-branes. 
What’s understood, again from quantum mechanics, is that 
the mass per unit area of an M2-brane is a fixed number. 
This is actually more information than we have about string 
theory itself: the mass per unit length of a string is arbitrary 
as far as we know. 

Besides D-branes and M-branes of various stripes, there’s 
one more brane in superstring theory. Actually, it was the first 
one to be understood. It’s a 5-brane, like the M5-brane, but 
it lives in ten dimensions, not eleven. It’s sometimes called the 
solitonic 5-brane, and for lack of a more descriptive name, I'll 
stick with that one. Solitons are a widespread notion in physics, 
and in general they are heavy, stable objects. The classical ex- 
ample is a wave that can travel along a channel, like a canal, 
without dissipating or breaking. “Soliton” evokes the word 
“solitary.” It’s supposed to communicate the idea that a soliton 
has its own identity. Nowadays we understand that D-branes 
have their own identity too. All branes can be loosely de- 
scribed as solitons of string theory. But here I'll use "solitonic" 
to describe only the 5-brane I was just starting to talk about. 

The solitonic 5-brane is worth mentioning for two rea- 
sons. First, when we get to discuss string dualities, it’s useful 
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to know that the solitonic 5-brane exists, because duality 
symmetries relate it to other branes. Second, our under- 
standing of the solitonic 5-brane is an example of the idea 
that spacetime has no meaning by itself, but exists only to 
describe how strings move. I tried to illustrate this idea in 
chapter 4 using an analogy between strings in spacetime and 
cars on a racetrack. The first salient feature of a racetrack 
that I suggested could be deduced from a record of how the 
racecars move is that it’s a closed loop. Well, the central idea 
of the solitonic 5-brane is similar. You start out by assuming 
that superstrings move on the surface of a sphere. Actually, 
for technical reasons, the sphere you use has one dimen- 
sion more than the one that approximates the shape of the 
Earth’s surface. This higher-dimensional sphere is called a 
3-sphere. What I want to convey is that it’s like the racetrack 
in my analogy: closed, finite, and of definite size. Now, if 
you recall, superstrings are quite picky about what kind of 
geometries they will tolerate. They insist on ten dimensions, 
and they insist that the equations of general relativity should 
be obeyed. Having started with a 3-sphere, you need to add 
time plus six spatial dimensions. The overall shape you wind 
up with is quite distinctive. Here’s how it looks. Spacetime 
far from the solitonic 5-brane is flat and ten-dimensional. As 
you move inward, you find a deep hole in spacetime with a 
definite size: the size of the 3-sphere you started out with. 
This “deep hole” is related to a black hole, just like every 
other brane in string theory. But it turns out you can go as 
deep into a solitonic 5-brane as you like without crossing 
a horizon. What that means is that no matter how deep 
you go into a solitonic 5-brane, you can always turn around 
and come back. Physics deep down in the hole eventually 
gets pretty strange: strings start interacting strongly, and in 
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some cases an extra dimension opens up, bringing us back 
to eleven dimensions. 


I hope this chapter leaves you with two overall impressions. 
First, strings are not the whole story—far from it. Second, 
the whole story is complicated and detailed. At least, it seems 
complicated and detailed. Often, when things get so compli- 
cated and detailed, a deeper level of understanding eventu- 
ally simplifies the story. A good example is chemistry, where 
there are about a hundred different chemical elements. The 
unifying understanding came from the realization that all of 
them are made up of protons, neutrons, and electrons. There 
is a similar profusion of elementary particles in the conven- 
tional understanding of high-energy particle physics. There 
are photons, gravitons, electrons, quarks (six kinds!), gluons, 
neutrinos, and a few others. String theory aims to be a uni- 
fying picture, where each of these particles is a different vi- 
brational mode of a string. At some level it’s disappointing to 
learn that superstring theory has its own proliferation of dif- 
ferent objects. On the positive side, this proliferation forms 
an extraordinarily tightly woven web, where every type of 
brane can be related to every other one and to strings. These 
relations are the topic of the next chapter. 

It’s hard to keep from wondering whether there’s some- 
thing deeper and simpler than branes—maybe some kind of 
“sub-brane” from which all branes are composed. I don’t see 
any hint of "sub-branes" in the mathematics of string theory. 
But there are certainly hints in plenty that our understand- 
ing of that mathematics is incomplete. The third superstring 
revolution, if it ever comes, has a lot of problems to solve. 
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STRING DUALITIES 


A DUALITY IS A STATEMENT THAT TWO APPARENTLY DIFFER- 
ent things are equivalent. I already discussed one example 
in the introduction: a checkerboard. You can think of it as 
black squares on a red background, or as red squares on a 
black background. Those are “dual” descriptions of the same 
thing. Here’s another example: dancing the waltz. Probably 
you've seen this in old movies, or maybe you've even done 
it. The man and the woman face one another, close together. 
"There's a particular way you hold your arms, but never mind 
about that. What matters most is the footwork. When the 
man steps forward on his left foot, the woman steps back on 
her right foot. When the man steps forward on right, the 
woman steps back on left. As the man turns, the woman 
turns to stay facing him. If you leave out fancy moves like 
spins, you could work out exactly what the woman should be 
doing based on what the man does—and vice versa. There's 
the old joke that Ginger Rogers did everything that Fred 
Astaire did, but backwards and in heels. That's kind of like a 
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string duality. Every object in one description can be equally 
well captured in another. 

When you watch Fred and Ginger dance in an old movie, 
part of the charm of the dance is how they mirror one an- 
other. Similarly, in string theory, when you understand a du- 
ality, you get a more insightful and informative picture than if 
you only understood one side of the duality. Limiting yourself 
to only one side of the duality would be like watching only 
Fred, or only Ginger. Captivating, maybe, but incomplete. 

Here's a real example of a string duality. We've talked about 
strings, and we've talked about D1-branes. Both of them ex- 
tend in one dimension of space. As in the previous chapter, 
I mostly want to focus on ten-dimensional superstring the- 
ory, as opposed to the 26-dimensional string theory that has 
tachyon instabilities. A famous string duality, called S-duality, 
interchanges superstrings with D1-branes. That's interesting, 
but it’s only one aspect of the duality—as if all I told you about 
the waltz is that the woman steps back on her right foot when 
the man steps forward on his left. In order to give a fuller ac- 
count, I have to tell you what S-duality does to every brane in 
superstring theory. Before I do that, I have to introduce one 
additional complication. There are different kinds of super- 
string theory. They can be distinguished by which kinds of 
branes are allowed. The type of superstring theory I want to 
talk about is called Type IIB. This name is not very descrip- 
tive. It was chosen before a lot of the unique dynamics of this 
particular string theory were understood. But I'll stick with it. 
Type IIB string theory has D1-branes, D3-branes, D5-branes, 
solitonic 5-branes, and a few other branes that are more com- 
plicated to explain. It doesn't have DO-branes, or D2-branes, 
or any other even-numbered brane. It's a string theory, not 
M-theory, so it doesn't have M2-branes or M5-branes. 
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Back to S-duality. I introduced it by saying that strings 
are exchanged with D1-branes. It turns out that D5-branes 
are exchanged with solitonic 5-branes, and D3-branes are 
unaffected by the duality. What this means is that if you start 
with a string on one side of S-duality, you end up with a D1- 
brane on the other side; but if you start with a D3-brane on 
one side, you end up with a D3-brane on the other. There’s 
more to the story, but already we can put together some of 
the statements I've made to learn something new. A string 
can end on a D5-brane. (This is because a D5-brane, like any 
D-brane, is defined as a location where strings can end.) How 
does S-duality affect this statement? S-duality instructs us to 
replace “D5-brane” with “solitonic 5-brane" and "string" by 
“D1-brane.” So the new statement is that a D1-brane can end 
on a solitonic 5-brane. This new statement can be indepen- 
dently checked, and it's true. String dualities got built up in 
roughly this way: certain rules of translation were proposed, 
then new consequences were deduced and checked. 

In general, a string duality is a duality relation between 
two apparently different string theories, or between two ap- 
parently different constructions in string theory. A whole 
web of string dualities is now known. It is so well connected 
that you can start with any brane you like, go through a few 
dualities and “deformations,” and wind up with any other 
brane. lll explain as I continue just what I mean by a de- 
formation. Before we start, it's worth returning to an earlier 
point about unifying pictures that we discussed near the end 
of chapter 5. There are so many different branes in string 
theory! One might expect eventually to find a unifying pic- 
ture where all branes are different manifestations ofthe same 
underlying structure. Dualities aren't like that. They trade 
one type of brane for another. Sometimes they trade branes 
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for strings. At our current level of understanding, it appears 
that all types of strings and branes are at some level coequal. 
This is qualitatively more than chemists understood about the 
different elements of the periodic table before atomic theory. 
But it is qualitatively less than physicists understood about 
chemical elements after atomic theory was well established. 

The story of string dualities got going just as I was a begin- 
ning graduate student. I remember viewing them with some 
skepticism. Was this really what I wanted to study? It certainly 
was a pretty subject, but it seemed pretty distant from the goal 
of making string theory into a theory of everything. My take 
on the subject now is that it was an inevitable progression in 
our understanding. Some of the best prospects for connecting 
string theory to experiment rest squarely on dualities. 

Our understanding of string dualities varies. S-duality is 
actually one of the more mysterious kinds of dualities. The 
rule for mapping strings to D1-branes is well understood and 
well checked in the case where the strings or the D1-branes are 
stretched out, straight, and (nearly) motionless. But the rules of 
S-duality are not so well understood for strings or D1-branes 
that are flopping around and colliding with one another in 
arbitrary ways. The difficulty relates to the strength of string 
interactions. I’ve described the splitting of a string into two 
strings as similar to a pipe branching into two smaller pipes. 
The surface of the pipe is like the worldsheet of the string, 
which is the surface in spacetime that the string sweeps out 
over time. Strings joining would be like two pipes coming 
together into one larger pipe. The strength of string interac- 
tions is a way of quantifying how frequent those splitting and 
joining events are. When string interactions are weak, a string 
travels a long way before splitting or interacting with another 
string. When string interactions are strong, there are so many 
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splittings and joinings that you can scarcely keep track of a 
single string: no sooner would you identify it than it would 
split, or join another string. When strings interact strongly, 
D1-branes interact weakly, and vice versa. So S-duality inter- 
changes weakly interacting and strongly interacting behavior. 

In case all of this got out of hand for you, let’s return to 
the dancing analogy. Weakly interacting behavior in string 
theory is clean, simple, and elegant. It's like Fred Astaire’s 
dancing. Strongly interacting behavior is chaotic and messy. 
Strings fly all over the place, but they’re hardly strings any- 
more because they're splitting and joining so fast. The only 
analogy I can think of is a slimy alien. So S-duality is like 
Fred Astaire dancing with a slimy alien— sorry, Fred. But 
this alien is actually just as good a dancer as Fred, in its 
own way. We just can't easily appreciate what it's doing. If 
we were aliens ourselves, the opposite would be true. We'd 
perceive the alien's dance as clean, simple, and elegant, and 
because of our altered perspective, it would be Fred that 
looked like a slimy mess. The point that I’m aiming at in this 
analogy is that string dualities often relate something that we 
understand well (like weakly interacting string theory) with 
something that we don’t (like strongly interacting behavior). 

You may recall that when I discussed strongly interacting 
string theory in the previous chapter, the upshot was that a 
new dimension opened up. I claimed that string theory starts 
behaving like it is actually eleven-dimensional, not ten- 
dimensional. That's pretty different from what I explained 
in the last few paragraphs. In fact, I had in mind a different 
string theory. The one that grows an extra dimension when 
the string interactions become strong is called Type IIA 
string theory. It has DO-branes, D2-branes, D4-branes, D6- 
branes, solitonic 5-branes, and some other objects that are 
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a bit harder to explain. When the string coupling is strong, 
Type IIA string theory is best described in terms of eleven 
dimensions. But Type IIB string theory at strong coupling 
is best described by swapping D1-branes for strings, without 
doing anything funny with extra dimensions. 

I've emphasized already that there's a lot we don't under- 
stand about string dualities. So it's worth closing this section 
by summarizing the two things that we do understand reli- 
ably for every single string duality. The first is the low-energy 
theory. In every string theory we know, gravity is always part 
of the story. The description of gravity in general relativity 
is very special and very durable. It has a limited set of gen- 
eralizations, which are the supergravity theories that I men- 
tioned in the previous chapter. Supergravity theories capture 
the low-energy dynamics of superstrings because they include 
only the lowest energy vibrational modes of the superstrings. 
Our understanding of gravity and supergravity is so complete 
that, collectively, they become one of the main touchstones 
for our understanding of string dualities. The second touch- 
stone is long, straight strings and long, straight branes. These 
are the objects that can be described as zero-temperature 
black holes in supergravity. They also have special no-force 
properties, such as those I described in my discussion of D0- 
branes. A minimum specification of a string duality amounts 
to describing what happens to the low-energy theory, plus 
what happens to these long, straight branes. 


A dimension here, a dimension there, who's counting? 


In this section, I want to discuss the best-understood string 
duality. It's called T-duality. These names—S-duality and 
T-duality—are as arbitrary as Type IIA and IIB. String 
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theorists face a peculiar difficulty in naming things: we ex- 
plore at the edge of knowledge and have to have names for 
things. So we make them up as we go. Often, the names we 
pick are arbitrary, or they refer to some very early work on 
a topic. But the names tend to stick even if the relevance of 
the early work fades. So we wind up with a hodge-podge 
of funny names. I think other fields of science have similar 
difficulties, but maybe not to the same degree. 

Anyway, T-duality is the string duality that relates Type 
IIA and IIB string theory. It is so well understood because 
the whole story makes sense when strings interact only 
weakly. That means that strings travel a long way, or last a 
long time, before splitting or joining. 

There is obviously a big problem in relating Type HA 
and IIB string theory. IA string theory has even-numbered 
D-branes: DO, D2, D4, D6. IIB has odd-numbered D-branes: 
D1, D3, D5. How can you possibly map a DO-brane onto a 
D1-brane? Especially if the D1-brane is long and straight, it 
seems impossible. Well, here’s the trick. You roll up one of 
the ten dimensions of type IIA string theory on a circle. If 
that circle is much smaller than the length scales you can 
observe, then it looks like string theory has only nine di- 
mensions. You could keep rolling up more dimensions until 
you get down to four. But let's not do that. We're trying to 
explain the relations among string theories here, not (at least, 
not yet) their possible relation to the world. So let's stick with 
just one rolled up dimension. In our new nine-dimensional 
world, the claim is that you can't tell the difference between 
Type IIA and Type IIB string theory. Take a type HA D0- 
brane, for example. If you wrapped a D1-brane all the way 
around the circle, it would look like a DO-brane to an ob- 
server whose observational powers aren't precise enough to 
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see the size of the rolled up dimension. All I mean is that to 
such an observer, the wrapped D1-brane wouldn’t seem like 
it had any spatial extent at all. It would seem like a point 
particle: a O-brane. But wait! Isn't it possible for the D1-brane 
not to be wrapped, but instead to extend in one of the nine 
dimensions that our hypothetical hyperopic observer can see 
clearly? Well, yes, it is possible. On the other hand, it's also 
possible to wrap a D2-brane around that circular dimension. 
Then its shape is like a long hose. The cross-section ofa hose 
is circular: that's the circular dimension that we rolled up. 
Just as a hose can snake across your lawn in a more or less ar- 
bitrary way, so a wrapped D2-brane can wander across nine 
dimensions. To the nine-dimensional observer we've been 
discussing, it looks just like a D1-brane. That's because this 
observer can't see closely enough to tell that the D2-brane is 
wrapped around on the extra dimension. The story contin- 
ues about as you'd imagine: wrapped D3-branes act like D2- 
branes, wrapped D4-branes act like D3-branes, and so on. 
The discussion so far might leave you with the impression 
that T-duality is only an approximate truth. Type IIA and IIB 
string theory look the same to a nine-dimensional observer 
only if she’s not allowed to look so closely that she can discern 
the tenth dimension rolled up as a circle. Actually, T-duality 
is an exact duality. When you look at it using just the right 
mathematical language, it's almost as simple as the checker- 
board duality between red and black squares. Although that 
mathematical language isn't really available to us, I can tell 
you the main point: a Type IIA string wrapped around the 
circular dimension is the same thing as a Type IIB string that 
isn't wrapped, but that instead is moving around the circle. 
Conversely, a Type IIA string that is moving around the circle 
is the same thing as a Type IIB string that is wrapped around it. 
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9-dimensional view 


T-duality of Type IIA and Type IIB string theory. Both are related to a 
nine-dimensional theory. A 0-brane in nine dimensions can originate 
from a DO-brane in Type IIA theory, or equivalently from a D1-brane 
wrapping the circle in Type IIB theory. 


The tricky bit is that the circle that the Type IIA string can 
wrap or move around has a different size than the one that 
the Type IIB string can move around or wrap. To under- 
stand this, we have to remember a little bit about quantum 
mechanics. When an electron moves inside an atom, it has 
definite, quantized energies, but its position and momentum 
are uncertain. A string that moves quantum mechanically 
around a circle is similar: it too has definite, quantized ener- 
gies, but uncertain position. It turns out that the momentum 
of the string is quantized, like the energy. This is interesting, 
because it means that the uncertainty principle doesn't apply 
in its usual form to motions on a circular dimension. Instead, 
the mathematics that leads to the uncertainty principle tells 
us that when the circle is very small, the momentum of a 
moving string must be very big. As a consequence, its energy 
1s very big too. Conversely, if the circle is very big, then the 
energy of a moving string can be very small. Let's compare 
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this situation with the energy of a string wrapping around a 
circle. The mass of a wrapping string is proportional to its 
length: that is, if you double the length, you double the mass. 
This is one way in which a string in string theory behaves 
just like ordinary string: it has a fixed mass per unit length. It 
follows that a string wrapping once around a big circle must 
be very heavy, while a string wrapping once around a small 
circle is light. Now we can come to the punch line. If you're 
going to replace a Type IIA string that is moving on a circle 
with a Type IIB string that wraps around a circle, you'd bet- 
ter do it in such a way that the energies match. If the circle 
that the Type IIA string theory moves on is small, then the 
energy is big, and that means that the circle the Type IIB 
string wraps had better be big. Likewise, if the IIA circle 
is big, the IIB circle must be small. If you squeeze the IIA 
circle smaller and smaller, the IIB circle gets so big that you 
can scarcely tell that it's a circle at all. We could describe the 
situation by saying that the IIB circle opens up into a nearly 
flat spatial dimension. This might remind you a little of the 
duality between Type IIA string theory and M-theory. In 
that duality, an eleventh dimension opens up when you make 
string interactions very strong. 

I promised to explain the term "deformation," which I 
used in the previous section in connection with string duali- 
ties. Changing the size ofa circle is one example ofa deforma- 
tion. Changing the strength of string interactions is another. 
In general, what I mean by a deformation is any change that 
takes place smoothly. A string duality is not a deformation. 
Instead, it’s a relation between two theories, each of which 
can be deformed. Or you could think of a string duality as 
just a change of perspective: you describe the same physics in 
two different ways. Sometimes one is much simpler than the 
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other: for example, Type IIB string theory is much simpler 
when interactions are weak than when they are strong. And 
yet S-duality exchanges weak and strong interactions. This 
relation of simplicity to complexity is what my analogy to 
Fred Astaire and the slimy alien was supposed to capture. 
What this analogy doesn’t capture so easily is that you can 
smoothly change the strength of string interactions, from 
weak to strong or strong to weak. It’s as if we could gradually 
deform Fred Astaire into the alien, while at the same time the 
alien turns gradually into Fred. One of the central insights 
of the second superstring revolution was that by deforming 
a theory in various ways, and passing through the various 
known dualities, one can get from any string theory to any 
other. I’ve introduced you to three: T-duality, relating Type 
IIA string theory to Type IIB, S-duality, relating Type IIB 
to itself, and the duality that relates Type IIA string theory to 
M-theory. There are three other superstring theories, and 
dualities that relate them, but I don't think it would help to 
discuss them here. 

On a first pass, I expect it is hard to keep track of all 
the different branes and dualities. But I hope that one point 
came across clearly: spatial dimensions in string theory are 
mutable. They come, they go, they shrink and grow. It's not 
clear to me that the eventual relation of string theory to the 
world has to involve extra dimensions per se. If spacetime 
is only an approximate notion when dimensions are small, 
maybe the right description of the world involves four big 
dimensions—the ones we know and love—and then some 
more abstract mathematical qualities that stand in for extra 
dimensions. There are constructions like this dating back to 
the first superstring revolution, but they're not very popular 
these days. 
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Gravity and gauge theory 


One particular string duality has become a field unto itself: 
the gauge/string duality. It is unusual in that it relates type 
IIB string theory not to another string theory, but to a gauge 
theory. I discussed gauge symmetry in chapter 5 at some 
length. Let me recap the essential points. Gauge symme- 
try guarantees that photons are massless. It guarantees that 
the axis of the spin of a photon is aligned with the direc- 
tion of motion of a photon. And it allows us to view elec- 
tric charge as rotation in an abstract space associated with 
the gauge symmetry. A gauge theory is any theory whose 
mathematical description includes a gauge symmetry. Usu- 
ally that means that the theory includes photons, or things 
like photons. The theory of light (which 1s also the theory of 
electric and magnetic fields) is a simple gauge theory. More 
complicated gauge theories are of interest not just to string 
theorists, but also to particle physicists, nuclear physicists, 
and condensed matter physicists. 

You may remember that the gauge symmetry of photons 
and electrons is secretly the same as the symmetries of a cir- 
cle. A charged object, like an electron, effectively has some 
rotation around this circle. We don’t have to take this circle 
as literally as we do the eleventh dimension of M-theory. 
It’s only there in the mathematics to tell us about electric 
charges and their interactions with photons. One aspect of 
this mathematics is that photons themselves do not carry 
electric charge: they only respond to it. 

It is natural to ask: If the symmetries of a circle are associ- 
ated with photons, is there a gauge theory associated with the 
symmetries of a sphere? It turns out there is such a theory. It 
has three different kinds of photons, corresponding to the 
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three ways of rotating a sphere. (In aviation, those three types 
of rotations are called pitch, roll, and yaw.) What makes them 
really different from ordinary photons is that they're charged. 
You may recall that we had an extended discussion about 
the cloud of virtual particles that surround an electron or 
a graviton. Again I will recap the main points. There is a 
clear distinction between gravity, where gravitons proliferate 
by responding to one another, and electromagnetism, where 
photons can only proliferate by splitting into electrons, which 
produces further photons, and so on. The latter case is by far 
the more tractable. You can keep track of the whole cas- 
cade of virtual particles. Photons and electrons are therefore 
said to form a renormalizable theory. This theory is called 
quantum electrodynamics, or QED for short. Gravity, on the 
other hand, is non-renormalizable. This means that the vir- 
tual gravitons cascade out of any sort of mathematical control 
we know how to impose. Now, what about the gauge theory 
relating to the symmetries ofthe sphere? It turns out it’s more 
like QED than like gravity. It's renormalizable. 

A cornerstone of our understanding of the physics inside 
a proton is a gauge theory called quantum chromodynamics, 
or QCD for short. It is based on a symmetry group with eight 
different kinds of rotations. As usual, these rotations don't act 
in our usual four dimensions: they act in some more abstract 
mathematical space called “color space.” QCD is quite simi- 
lar to the gauge theory based on the symmetries of a sphere. 
It's just a bit more complicated because there are eight kinds 
of rotations instead of the pitch, roll, and yaw rotations of a 
sphere. Each of these eight rotations corresponds to a particle, 
similar to the photon. Collectively, these eight particles are 
called gluons. There are also particles like electrons, called 
quarks. But while electrons can only have negative charge, 
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quarks can have one of three different types of charge. This 
charge is called color, and color space is the mathematical tool 
for keeping track of it. A quark’s charge can be red, green, 
or blue. This is only a manner of speaking: there isn’t really 
any relation to the color we see with our eyes. Just as photons 
respond to the charge of an electron, so gluons respond to the 
charge of a quark. But gluons are also colored. They respond 
to each other the way gravitons do. Unlike the uncontrol- 
lable cascade of virtual gravitons from other gravitons, the 
cascade of virtual particles from a quark is something you 
can keep track of mathematically. So QCD is renormaliz- 
able, like QED. The name was chosen partly because QCD 
greatly resembles QED, and also because “chromodynamics” 
means “the dynamics of color.” Again, this is a notion of 
color divorced from what you see with your eyes. Color is 
just a way of visualizing a mathematical abstraction. 

Quarks, gluons, and color-that-isn’t-color make QCD 
sound almost as fanciful as string theory. But unlike string 
theory, it is experimentally very well tested. It is universally 
accepted as the correct description of physics inside the pro- 
ton. It has many odd features, the most notable being that 
you can never directly measure a quark. This is because it 
dresses itself with gluons and other quarks to such an extent 
that you never see anything but bound states of quarks and 
gluons. Protons are such bound states. So are neutrons. Elec- 
trons are not. They seem to have nothing to do with quarks. 
More properly, they are on an equal footing with quarks: 
separate and equal. One of the big unverified ideas of mod- 
ern particle physics is that electric charge might be secretly 
a fourth type of color. I'll discuss related ideas in chapter 7. 

The fluctuations of D3-branes are described by gauge the- 
ories similar to QCD. I’ve already discussed fluctuations of 
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D3-branes 


Three D3-branes very close to one another are labeled “red,” “green,” 
and “blue.” Strings running from one brane to another describe the 
fluctuations of the branes. 


D1-branes. I had two pictures for them: you can think either 
in terms of a ripple or wave traveling along the D1-brane, 
or you can think of strings attached to the D1-brane and 
sliding along it. The latter description generalizes better to 
D3-branes. Suppose we put three D3-branes on top of one 
another. Just for the sake of illustration, let's distinguish them 
by labeling one red, one blue, and one green. Ifa string runs 
from the red brane to the blue brane, then intuitively, it's 
colored. It's purple, right? Well, no—it turns out that's tak- 
ing the metaphor of color too far. The proper way to describe 
the string's color is simply that it runs from red to blue. And 
it turns out that this is exactly the kind of color that gluons 
possess. You can now almost understand why there are eight 
types of gluons. There's red-to-red, red-to-blue, red-to- 
green; three types starting with blue; and three more starting 
with green. That's nine total. Oops, one too many! Unfor- 
tunately, l'd have to introduce an unreasonable amount of 
extra mathematics in order to explain why one is extra. 

Up to this little problem of the extra gluon, we've seen 
how gluons come out of strings on a trio of D3-branes. 
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Quarks are trickier. In the interests of getting to a punch 
line, I'll leave them out. Clearly, it was just a choice to have 
three D3-branes together. I could have had just one. Then I 
would have just photons, like in electromagnetism. I could 
have had two, and then I'd get the theory I mentioned ear- 
lier where the gauge group was the symmetry of a sphere. 
Or I could have some large number N, in which case there 
are lots and lots of gluons: about N*. 

The next step is to remember that when many branes come 
together, the best description of them is in terms of a zero- 
temperature black hole. I explained this in chapter 5 in the 
case of DO-branes. The story is similar for D3-branes. With 
a lot of them on top of one another, they distort spacetime in 
their vicinity in such a way that there is a black hole horizon. 
The horizon surrounds the D3-branes in a way that’s hard to 
visualize, because there are too many dimensions. The shape 
of the horizon is like a cylinder. It’s round in some direc- 
tions and straight in others. But a cylinder is round in one 
direction and straight in one. It has no additional dimensions 
beyond those two. The horizon surrounding D3-branes is 
round in five directions and extended in three. So in total 
it’s eight-dimensional. Tough stuff! And pretty distant from 
QCD, or so it seems. If there is extra vibrational energy on 
the D3-branes, then the horizon grows a little and acquires a 
finite temperature. 

An important part of the gauge/string duality is the re- 
alization that you can apply formulas like E=k,T to the 
vibrations on D3-branes and gain an understanding of the 
temperature of the horizon surrounding the D3-branes. Let 
me try to explain why this is now regarded as a string du- 
ality. There are two ways to describe D3-branes at finite 
temperature. One is to keep track of all the open strings 
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sliding around on the D3-branes. Another is to keep track of 
the horizon surrounding the D3-branes. The two views are 
complementary in the following sense. If there is a horizon, 
then you can’t say for sure what’s inside it. In other words, 
the existence of a horizon prevents you from keeping track 
of the strings on the D3-branes. At least, you can’t keep 
track of them one by one. What you can do is to track ag- 
gregate quantities such as their total energy. It turns out that 
when there’s a horizon, the gluons are strongly interacting. 
They split and join frequently. They flicker in and out of 
existence. They dress themselves with complicated cascades 
of other gluons. As in strongly interacting string theory, 
they’re scarcely identifiable as gluons. The emergence of the 
horizon is somewhat like the growth of the extra dimension 
of M-theory. It explains strong coupling dynamics of gluons 
in a language that requires extra dimensions. 

There’s much more to the gauge/string duality than keep- 
ing track of the energy of thermal gluons. The right way to 
understand it is that the curved black hole geometry near 
the D3-branes is precisely equivalent to the gauge theory 
of gluons on the D3-branes. This is a strange statement be- 
cause the curved geometry is ten-dimensional, whereas the 
gluons only know about four dimensions. It is also strange 
because it relates a theory with gravity (string theory in the 
vicinity of the D3-branes) to a theory with no gravity (the 
gauge theory on the D3-branes). It seems at first more fo- 
cused, more narrow than other string dualities. T-duality, for 
example, relates the whole of Type IIB string theory to Type 
IIA string theory. It includes rules for mapping every type of 
D-brane to every other type. The gauge/string duality seems 
restricted to the dynamics of just one type of brane: the D3- 
brane. But in fact, other branes enter into the gauge/string 
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duality in interesting ways, for example to allow for quarks 
as well as gluons. I'll have more to say about the gauge/string 
duality in chapter 8, where I’ll describe some attempts to 
connect it to the physics of heavy ion collisions. 


As a closing thought for this chapter, let me point out that 
string dualities are different from symmetries, even though 
both express a notion of sameness. The two things related 
by a string duality can experience different numbers of di- 
mensions. As we just saw, one can include gravity while the 
other doesn’t. That seems really different from a symmetri- 
cal object like a square. All its corners are the same, and the 
symmetries of a square explain precisely how self-similar a 
square is. On the other hand, there are some string dualities 
where the two sides seem more nearly mirror images. For 
example, Type IIA and IIB string theory are really very sim- 
ilar, despite having different types of branes. String dualities 
show up in low-energy supergravity in a way that’s closely 
connected to ordinary symmetries like the symmetry of a 
square. It’s possible that our understanding of string dualities 
is incomplete, and that a more unified view of them would 
make the analogy with ordinary symmetries more precise. 
There are hints of such a unified view, but too much of what 
we do understand is restricted to the low-energy theories. 
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SUPERSYMMETRY 
AND THE LHC 


IN THE SUMMER OF 2008, WHEN THE CONSTRUCTION OF THE 
Large Hadron Collider, or LHC, was almost complete, I 
visited the site and took a tour of one of the main LHC 
experiments. Mostly I was there for a conference, but the 
tour was really fun. The experiment I visited, called the 
Compact Muon Solenoid, is about the size of a three-story 
building. I saw it in the last stages of being put together. A 
massive cone-shaped endcap was being fitted into the main 
barrel-shaped body of the detector. Its design is a little like 
a digital camera, but every part of it looks inward toward its 
center, where high-energy collisions of proton beams occur. 

When the conference finished, I took the opportunity 
to do a bit of mountaineering in the French Alps. Nothing 
hard—just a little alpine climbing. The last thing I did was 
to climb up a ridge to the Aiguille du Midi, from which 
my climbing partner and I caught a téléphérique that took 
us back down to the town below. The ridge we climbed is 
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famously narrow, heavily trafficked, and snow-covered. For 
some reason everyone seems to climb it roped up. I’ve never 
quite approved of the practice of climbing roped when no 
one is tied to a solid anchor. If one person falls, it’s hard for 
the others to avoid being pulled off their feet. Usually I think 
it’s better to trust yourself and climb unroped, or else anchor 
and belay. But I'll admit that I climbed the ridge roped up 
to my climbing partner like everyone else. My partner was a 
very solid climber, and the ridge isn’t really that tough. 

In retrospect, I think that roped teams climbing a narrow 
ridge provide a good analogy to the Higgs boson, which is 
one of the things LHC experimentalists hope to discover. 
Think of it this way. Standing on top of the ridge, you're 
balanced precariously. Both sides are quite steep, so if you 
fall off either way, you're a goner. Tachyons in string theory 
are like that: they’re balanced in an unstable way, and the 
slightest perturbation sends them sliding down a slope to a 
fate that string theorists are only starting to understand. But 
there’s more. Let’s say you have eight people roped up, and 
the first one falls off to the left. The second one is probably 
going to get pulled off to the left too. The third hasn’t got a 
prayer of holding the weight of two falling climbers, so he’s 
going to go too. The really right thing to do in this circum- 
stance is to jump off the other side of the ridge and trust the 
rope. But for some reason this is hard to do. 

Back to tachyons and the Higgs boson: The point I want 
to make is that tachyons usually indicate an instability at 
every point in space, and that those instabilities are "tied 
together" like roped-up climbers. If a tachyon starts rolling 
in one direction at one point in space, it tends to drag the 
tachyons nearby along with it. 
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The Higgs boson describes what happens after the tachy- 
ons finish “condensing.” (Tachyon condensation is the tech- 
nical term for the roll-off from the ridge.) Let’s imagine a 
merciful outcome for the unlucky climbing team that falls 
off a high ridge: they slide down to the bottom of a valley 
and come gently to a halt. Let’s suppose they’re so tired out 
that they can’t make it back up the slope. Instead they wan- 
der around near the bottom, occasionally making small for- 
ays up the slope and then sliding back down. This is roughly 
what the Higgs boson is like. Once the tachyons condense 
everywhere in spacetime, the quantum fluctuations around 
their resting point are Higgs bosons. 

A problem with the analogy between roped climbers and 
the Higgs boson is that the direction in which the Higgs 
boson moves is not one of the familiar three directions of 
space. Instead, it’s like an extra dimension of spacetime— 
but mathematically tamer. It’s also crucial to realize that the 
Higgs boson is hypothetical. It might simply not be there. 

Despite the hypothetical status of the Higgs boson, there’s 
a lovely, deep theory based upon it that has reigned supreme 
for decades as the best empirical description of particle phys- 
ics. It is called the Standard Model. “Standard” reminds us 
that it is widely accepted. “Model” evokes the fact that it 
is still provisional, and almost certainly incomplete. There’s 
much more to the Standard Model than just the condensa- 
tion of a tachyon. Among other things, it says that the Higgs 
boson controls the mass of subatomic particles such as elec- 
trons and quarks. It’s been hoped for years that an accelerator 
called the Tevatron, near Chicago, would find the Higgs. 
And there’s still some hope that it might. But the LHC should 
find either the Higgs boson or else some other weird stuff 
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that stands in its place. An earlier facility in Texas, the Super- 
conducting Super Collider, had an even greater chance to 
make dramatic new discoveries. Construction began in 1991. 
Then, in 1993, Congress pulled the plug. In doing so they 
probably saved the American taxpayer ten billion dollars. I 
think it was a bad choice. It certainly means that America 
has ceded its dominance in experimental particle physics to 
Europe for the foreseeable future. Fortunately, European na- 
tions stayed the course in building the LHC. And Americans 
have contributed significantly to the LHC effort. So we still 
have a shot at big, important discoveries. 


The weird math of supersymmetry 


A great hope for the LHC is that it could discover super- 
symmetry. This is the symmetry that keeps superstring the- 
ory balanced. It does so by excising tachyons, as I described 
briefly in chapter 4. It is also the symmetry that relates gravi- 
tons and photons and guarantees the stability of DO-branes, 
as I discussed in chapter 5. Supersymmetry and string theory 
are logically distinct. But they’re deeply intertwined. Dis- 
covering supersymmetry would mean that string theory is 
on the right track. There might still be skeptics who would 
point out that you can have supersymmetry without string 
theory. While this is true at some level, I think the existence 
of supersymmetry without string theory would be too great 
a coincidence to be believed. 

But just what is supersymmetry? I’ve danced around 
this question a few times already in this book. Let me now 
try to plow straight into it. Supersymmetry calls for extra 
dimensions of a very peculiar sort. The dimensions we're 
used to, and also the extra dimensions of string theory that 
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I've discussed so far, are measured in length. And length is 
a number: 2 inches, 10 kilometers, and so forth. You can 
add two lengths to get another length, and you can mul- 
tiply two lengths to get an area. The extra dimensions of 
supersymmetry are not measured in numbers. At least, not 
ordinary numbers. They are described by anti-commuting 
numbers, which are the cornerstone of the weird math of 
supersymmetry. Anti-commuting numbers also play a role 
in describing electrons, quarks, and neutrinos, which are 
collectively called fermions. Even though I haven’t defined 
“anti-commuting” or “fermion” yet, I’m going to use these 
words in the interests of calling things by their true names, 
or as close as I can come to their true names without using 
too much math. The extra dimensions of supersymmetry 
are called fermionic dimensions. 

Here’s what these funny fermionic dimensions feel like. 
You can choose to move into them or not, just as you can 
choose to move forward or sideways. But if you move in a 
fermionic dimension, there’s only one “speed” at which you 
can move. Speed itself is only a rough analogy to what it 
means to move in a fermionic dimension. What’s closer— 
though still incomplete—is spin. Moving in a fermionic di- 
mension means that you're spinning more than if you didn’t 
move. The spin of a top can be bigger or smaller accord- 
ing to how hard you twist it before letting it go. But fun- 
damental particles can only have certain amounts of spin. 
The Higgs boson (if it exists) has no spin. An electron has a 
minimal amount of spin. A photon has twice as much; but 
as we learned earlier, the axis of its spin has to align with 
its motion. A graviton has twice as much spin as a photon. 
And that’s it. No fundamental particle can spin more than 
a graviton. If supersymmetry is right, a Higgs boson isn’t 
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moving at all in the fermionic dimensions. An electron is 
moving in only one. A photon is moving in two. The story 
gets a little more vexing for gravitons: depending on how 
many fermionic dimensions there are, it might be that part 
of a graviton’s spin is due to its motion in the fermionic 
dimensions, and part of it is intrinsic to the ordinary dimen- 
sions of spacetime. 

To summarize, there’s a sort of exclusiveness about these 
fermionic dimensions. Either you experience them (like an 
electron does) or you don’t (like a Higgs boson). This ex- 
clusiveness has another manifestation, called the exclusion 
principle. It says that no two fermions can occupy the same 
quantum state. Electrons are fermions, and there are two of 
them in a helium atom. These two electrons can’t be in the 
same state. They have to vibrate differently around the he- 
lium nucleus, or they have to have different spins—or both. 
The definition of a fermion is something that obeys the ex- 
clusion principle. 

Bosons are all the other particles: photons, gravitons, glu- 
ons, and the Higgs boson—if it exists. Bosons are very dif- 
ferent from fermions. Not only are they allowed to be in the 
same state as other bosons; they prefer it. Supersymmetry 1s 
a relation between bosons and fermions. For every boson, 
there is a fermion, and vice versa. For example, if the Higgs 
boson exists and supersymmetry is correct, then there is a 
Higgs fermion, sometimes called the Higgsino, or occasion- 
ally the shiggs. Whatever you call it, the Higgsino would 
basically be a Higgs boson that’s moving in one of the fer- 
mionic dimensions. 

Fermionic dimensions are hard to draw. The way they’re 
usually studied is through some strange rules of algebra. Let’s 
say there are two fermionic dimensions. You have a letter for 
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each one: let’s use a and b. You can add and multiply them, 
and most of the usual rules of algebra apply. For example: 


ata=2a 
2(a + b) = 2a + 2b 
atb=b+a. 


But there are some very strange rules for multiplying fer- 
mionic quantities together: 


axb=-bxa 
axa=0 
bx b=0. 


The way to think about this is that 1 means you are mov- 
ing only in bosonic dimensions; a means you're moving in 
the first fermionic dimension; and b means you're moving in 
the second fermionic dimension. If you try to move twice as 
much in the first fermionic dimension, you might try to de- 
scribe yourself as a x a. The identity a x a = 0 says that the 
motion you just tried isn’t allowed. The meaning ofa x b = 
—b x ais harder to explain. To see why it is naturally part 
of the algebra of fermionic quantities, let me rephrase the 
rules of multiplication in the following way: q x q = 0 for 
any combination q of fermionic quantities. If q = a, you get 
axa=0.Ifq=b6, you get b x b = 0. But what do you get 
ifq=a-+ b? Let's multiply it out: 


(a+b) x a@t+b)=axataxb+bxatb™xb. 


Pll bet you used to do this kind of manipulation in high 
school math classes. My teachers called it a FOIL expansion. 
The first term on the right side of the equation is the first 
term from the left-hand factor times the first term in the 
right-hand factor. First-first is abbreviated “F.” The second 
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term is the product of the outer terms from the left-hand side: 
that is, a from the first factor and b from the second. Outer is 
abbreviated “O.” The third term is the product of the inner 
terms: b from the first factor and a from the second. Inner 
gives us the "I" in FOIL. The fourth term is the product of 
the last terms from each factor, so “L” for last-last. Now for 
the punch line. We've assumed that q X q = 0 for any fermi- 
onic quantity q, whether it’s a, or b, or a + b. If we use this as- 
sumption, then the FOIL expansion I just worked through is 


O=axb+bxa. 


That’s the same as a x b = —b x a, which is what I wanted 
to explain. A key idea to take away from this discussion is 
that fermionic dimensions require some funny algebra. You 
might even say that fermionic dimensions are nothing more 
than the algebraic rules that describe them. 

Supersymmetry is symmetry under rotations between bo- 
sonic dimensions and fermionic dimensions. What does this 
mean, exactly? Well, symmetry is a notion of sameness, like 
how a square looks the same if rotated by 90°. A bosonic 
dimension is one of the usual ones, like length or width. 
(The six extra dimensions of string theory are bosonic di- 
mensions too, but that doesn’t matter just now.) Fermionic 
dimensions amount to the funny rules of algebra I explained 
in the previous paragraph. A rotation between a bosonic di- 
mension and a fermionic dimension means that if a particle 
was moving in a bosonic dimension before the rotation, then 
afterwards it isn’t; and if it wasn’t before, then afterwards it is. 
Physically, if you start with a boson and rotate it into a fer- 
mionic dimension, then it becomes a fermion. If you think 
of this rotation in terms of mathematics, you would replace 
the number 1 (representing the bosonic dimension) by a or b 
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(representing a fermionic dimension). Where the notion of 
sameness comes in is that the fermion you end up with has 
the same mass and charge as the boson you started with. This 
brings us to one of the most distinctive predictions of super- 
symmetry: it implies that for every boson, there’s a fermion 
with the same mass and charge, and vice versa. 

One thing we know for sure is that the world isn’t perfectly 
supersymmetric. If there were a boson with the same mass 
and charge as an electron, we'd surely know about it. For one 
thing, it would totally alter the structure of the atom. What 
might be happening is that supersymmetry is “broken,” or 
violated, by some mechanism similar to tachyon condensa- 
tion. If the idea of a strange new symmetry that isn't really 
a symmetry starts making you feel like there's smoke and 
mirrors somewhere, I don't blame you. Like much of string 
theory, supersymmetry is a long chain of reasoning without 
firm contact with experimental physics. 

If the weird ideas of supersymmetry and fermionic di- 
mensions are borne out by discoveries at the LHC, it will be 
a triumph of pure reason beyond anything that's happened 
in our lifetimes. A lot of people really have their hopes set on 
it. But wishing won't make it so. Supersymmetry is there, in 
some approximate form, or it isn't. Frankly, lll be surprised 
either way. 


The theory of everything—maybe 


Here is a synopsis of the canonical ideas about how string 
theory describes the real world. String theory starts out 
with ten dimensions. Of course, I’m talking about super- 
string theory here, so there are some additional fermionic 
dimensions; but let's put them aside for a moment. Six of 
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the ten dimensions get rolled up in some more or less com- 
plicated way. There is a preferred way to do it that exploits 
the mathematical structure of superstrings, taking advantage 
of both the supersymmetry and some other properties of 
the worldsheet description. The rolled up dimensions are 
small—perhaps a few times bigger than the typical size of a 
vibrating string. All the overtone modes are so massive that 
they play essentially no role in physics accessible at the LHC. 
The most important information comes from the lowest vi- 
brational modes of the strings. In some scenarios, there are 
D-branes, or other branes, peppered through the extra di- 
mensions, and they introduce additional quantum states for 
strings that can be relevant to LHC physics. 

After rolling up six of the ten dimensions of string theory, 
what you really want to know is what physics in the re- 
maining four dimensions is like. The answer is that there 
is always gravity, and there is usually also a gauge theory 
not unlike QCD. Gravity comes from a massless string state 
that is quantum mechanically smeared out over the extra six 
dimensions. The gauge theory can come either from similar 
smeared-out string states, or from extra string states associ- 
ated with branes. 

Gravity in four dimensions is great—that's what general 
relativity describes. So the question of whether string theory 
provides a "theory of everything" mostly comes down to 
whether the gauge theory you get from rolling up the extra 
dimensions leads to realistic predictions about subatomic 
particles. To understand a little more about that gauge the- 
ory, first remember that we described the gauge symmetry 
of QCD in terms of three colors: red, green, and blue. Well, 
the best-motivated candidates to describe everything— 
quarks, gluons, electrons, neutrinos, and all the rest—have 
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at least five colors. String theory constructions can accom- 
modate five-color gauge symmetry in several natural ways. 
We don’t see those five colors yet because something serves 
to distinguish two of them from the other three. That some- 
thing could be similar to a Higgs boson, but there are other 
ideas. To understand what makes five special, remember the 
enumeration of fermions: there are quarks, electrons, and 
neutrinos. Quarks come in three colors, but electrons and 
neutrinos each come in just one. Three plus one plus one is 
five. It’s really that simple. 

After the dust settles, the best string constructions yield 
low-energy physics that is strikingly like what we have al- 
ready seen in particle physics experiments. Typically, they re- 
quire supersymmetry and demand not just one Higgs boson, 
but two; and they require a whole host of other particles 
whose masses are comparable to the Higgs. They also ac- 
commodate a very small mass for the neutrino. And they 
incorporate gravity as described by general relativity. All in 
all, this is pretty impressive: certainly, no other theoretical 
framework for fundamental physics does as well in supplying 
the right ingredients with the right dynamics. If string theo- 
rists could somehow hit upon just the right construction, it 
would be the “theory of everything”: that is, it would include 
all the fundamental particles, all the interactions they experi- 
ence, and all the symmetries they obey. Nothing would be 
left but to solve the equations of this theory and predict every 
measurable quantity in particle physics, from the mass of the 
electron to the strength of interactions among gluons. 

There are, however, some persistent difficulties. A lot 
depends on the size and shape of the extra six dimensions. 
There is no reason that we know of why these dimen- 
sions couldn’t be flat. In other words, we don’t know of 
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any dynamics that would force us to live in four dimensions 
instead of ten. One possibility is that all dimensions were 
tightly curled up in the early universe, and that for some 
reason it is easier for just three of them to unroll into the 
spatial dimensions of our experience than for all nine to un- 
roll. But this still doesn’t explain in any detail why the extra 
dimensions have the shape they have. To make things worse, 
the extra dimensions tend to be floppy. To understand what 
I mean by that, let me remind you of our discussion of the 
clump of DO-branes. It too had a certain floppiness, in that 
each DO-brane was only barely restrained from flying away 
from the others, and D0O-branes outside the clump were nei- 
ther attracted nor repelled by it. The floppiness of extra di- 
mensions means that they could change their size or shape 
just as easily as a DO-brane could escape from the clump. 

A lot of effort has gone into finding ways to tie up these 
extra dimensions so that they dont flop around anymore. 
The typical ingredients are branes and magnetic fields. It's 
easy to understand the role of branes. They're like pack- 
ing cord you tie around a package. But suppose your pack- 
age was really mushy. You'd need lots of packing cord to 
keep the package from bulging out one way or another. The 
magnetic fields play a similar role in that they stabilize the 
extra dimensions in some way. 

The picture you wind up with is that the extra dimensions 
are complicated. There are probably many, many ways to tie 
them up so that they can't flop around. This myriad of pos- 
sibilities is sometimes regarded as a good thing, because of 
another problem, called the cosmological constant problem. 
Briefly, if there is a cosmological constant, then the three di- 
mensions of spacetime themselves have a tendency to bulge 
out over time. We see from astronomical observations that 
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The world according to string theory (maybe). The usual four 
dimensions (above) have a slight tendency to expand. The extra six 
dimensions (below) have to be tied up with wrapped branes and other 
tricks to keep them from bulging out or changing shape. 
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most galaxies are moving away from us, and this is inter- 
preted as an expansion of space itself. What a cosmological 
constant would do is to cause that expansion to accelerate. In 
fact, observations over the last ten years seem to indicate that 
the expansion of the universe is accelerating in a way that’s 
consistent (so far) with a very small cosmological constant. 
If we want to describe the world using string theory, then it 
seems we need to tie up the extra six dimensions so that they 
can't move at all, but leave the usual three with just the slight- 
est tendency to expand and to accelerate their expansion. It's 
hard to figure out exactly how to do that. But it does seem 
that the number of ways of tying up the extra dimensions is 
tremendously large. According to some string theorists, with 
so many possibilities available, there must be at least a few 
where everything works out just right, so that the cosmo- 
logical constant comes out in an acceptably small range. Our 
universe just happens to be one in which the extra dimen- 
sions are tied up in just the right way. If it weren't—the ar- 
gument goes—intelligent life would probably be impossible, 
so we wouldn't exist. Turning things around, our existence 
implies that the universe we inhabit has a small cosmological 
constant. Altogether, I find myself unconvinced that this line 
of argument is useful in string theory. 

String theorists have been hammering on the question 
of how to cook up the theory of everything for more than 
twenty years. Rolled up extra dimensions always play a role. 
'The more we learn about string theory, the more possibilities 
there seem to be. It's embarrassing. Perhaps it is worth com- 
paring the difficulty of getting fully realistic four-dimensional 
physics out of string theory to a long-standing problem in an- 
other corner of theoretical physics: high-temperature super- 
conductivity. Discovered starting in 1986, high-temperature 
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superconductors conduct large amounts of electricity without 
significant energy loss. High-temperature is perhaps an exag- 
geration: the temperatures in question are comparable to the 
freezing point of air. But that’s a lot hotter than previously 
known superconductors, and there are already some impor- 
tant industrial applications. Theoretically, though, it’s very 
hard to understand how high-temperature superconductivity 
works. There’s a theory from the 1950s that explains ordinary 
superconductors, and it’s based on tying pairs of electrons to- 
gether. The force that pairs them together is based on sound. 
The electrons sort of “listen” to one another, over distances 
many times the size of an atom, and then coordinate their 
motion so as to avoid energy loss. Magical. But also fragile. 
Too much thermal motion prevents this pairing from occur- 
ring: it’s as if the electrons can’t “hear” one another over the 
din of thermal noise. It’s believed that no amount of tinker- 
ing with this 1950s explanation, in which electrons coordi- 
nate their motions through sound waves, will account for the 
remarkable properties of high-temperature superconductors. 
Electrons probably still pair up in these materials, but over a 
much shorter distance, in a much stronger way. They seem to 
take advantage of fine-grained features of their environment 
to pair up. There are some compelling theoretical ideas about 
how this happens, but I do not think the problem is solved. 
Solved or not, high-temperature superconductivity might 
offer some lessons for string theory. The main one is that pure 
reason is often not enough. High-temperature superconduc- 
tors were an experimental discovery, and theory has been 
struggling ever since to catch up. The correct theory of the 
world could be quite different from what we are now capable 
of imagining. The fragile pairing of electrons through sound 
waves reminds me of the floppiness of extra dimensions: just 
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barely holding together. It could be that the way string the- 
ory really relates to the world is as different from these tied- 
up bundles of branes, magnetic fields, and extra dimensions 
as the modern explanations of superconductivity are from 
theories of the 1950s. And it might take at least as long to 
figure it all out. 


Particles, particles, particles 


In chapter 5, I alluded briefly to the long list of known ele- 
mentary particles: photons, gravitons, electrons, quarks (six 
kinds!), gluons, neutrinos, and a few others. Explaining the 
whole list wouldn’t add much to the obvious point that this is 
a lot of different particles, each with its own peculiar proper- 
ties and interactions. Long lists of objects cry out for a unify- 
ing theory with fewer elementary objects and a deeper level 
of explanatory power. Chemistry’s periodic table receives 
such a unifying treatment through atomic theory. Helium, 
argon, potassium, and copper are all as different as they ever 
were in chemical reactions. But atomic theory reveals that 
they are all composed of electrons in quantum states of vibra- 
tion around an atomic nucleus composed of protons and neu- 
trons. The long list of elementary particles might receive a 
unifying treatment in terms of string theory. As for the long 
list of objects in string theory—D-branes, solitonic 5-branes, 
M-branes, and so on—no one knows how or whether they 
might be unified beyond the level of string dualities. 

The most massive particle discovered to date is the top 
quark. Its mass is about 182 times the mass of a proton. It 
was discovered in 1995 by large experimental collaborations 
at the Tevatron, which is the premier particle accelerator 
in the United States. Protons and anti-protons are whirled 
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A proton-proton collision at the LHC might produce a Higgs boson 

in the manner indicated here. In the process I have drawn, the Higgs 
promptly decays into a bottom quark and an anti-bottom quark, which 
can be detected. But the “junk particles” can cause confusion as to 


what really happened. 


around a large ring (about 6.3km around) and smashed to- 
gether in head-on collisions. When they hit, each of them 
has an energy 1000 times as large as its rest mass. It’s not 
surprising that such collisions can produce a top quark: 
there’s plenty of energy available. In fact, there’s apparently 
enough energy to make a particle ten times as massive as 
the top quark: 1000 + 1000 = 2000 proton masses. Unfor- 
tunately, it’s all but impossible for all this energy to go into 
a single particle. This is because protons and anti-protons 
have structure. Each contains three quarks and also some 
gluons. When the proton and anti-proton collide, most of 
the quarks and gluons miss one another, or experience only 
glancing collisions. What’s interesting is the situation where 
one quark or gluon from the proton hits hard against one 


SUPERSYMMETRY AND THE LHC 


134 


from the anti-proton. Such a hard hit—more commonly de- 
scribed as a “hard process"—14s what creates top quarks at the 
Tevatron. Hard processes should also create Higgs bosons, 
if they exist. Because hard processes involve only one quark 
or gluon from the proton and one from the anti-proton, the 
energy available to make top quarks is only a fraction of the 
total energy of the collision. 

The LHC will collide pairs of protons with a total energy 
of about 15,000 times the mass of a proton. The amount of 
energy available in a hard process might be about a tenth of 
this—sometimes more, sometimes less. Speaking in round 
numbers, the LHC can be expected to produce particles 
copiously whose rest mass is up to 1000 times the mass of 
a proton. Heavier particles should also be produced, up to 
perhaps 2000 times the mass of a proton. 

But the heavier a particle is, the rarer it will be that a hard 
process has enough energy to produce it. 

Just what kinds of particles should we expect the LHC to 
discover? At time of writing, the honest answer is: We're not 
sure, but there had better be something. I do not mean this 
in the sense that the LHC will be a big waste of money if 
it doesn't discover anything—though that's obviously true. 
What I mean is that there's a good argument, independent 
of ideas of supersymmetry or string theory, that there is 
something lurking at the energy range that the LHC will 
explore. It could be just the Higgs boson. Most likely, it's 
the Higgs boson and some other particles. If we're lucky, it 
will be supersymmetry. The argument that something has to 
be there relies on renormalization. I gave a brief, qualitative 
account of renormalization in chapter 4, but to remind you, 
it is the mathematical machinery that allows us to keep track 
of the cloud of virtual particles that surround an electron, 
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or indeed any particle. This machinery only works if there's 
something like a Higgs boson in a range of energies that the 
LHC should explore. For it to work smoothly, there has to 
be something like supersymmetry in addition to the Higgs. 
But let's not forget that our mathematical machinery is not 
the world. We could just be wrong. There could be some- 
thing at the LHC that we haven't imagined. That would 
be the most exciting possibility of all. Or—despite all our 
well-reasoned expectations—there could be nothing to see. 

Let's get back to supersymmetry, which is a favored can- 
didate to describe LHC physics. As I explained earlier, a 
striking prediction of supersymmetry is that for every par- 
ticle that we know, there's a new one with the same mass 
and charge and essentially the same interactions, but dif- 
ferent spin. We know the electron. Supersymmetry pre- 
dicts a super-electron, or "selectron" for short. We know 
the photon. Supersymmetry predicts a super-photon, more 
commonly known as a “photino.” Likewise, supersymmetry 
predicts squarks, gluinos, sneutrinos, and gravitinos. Even 
the Higgs would get a super-partner, usually called the 
higgsino (but occasionally the shiggs). As I also explained 
earlier, supersymmetry can't be exactly right: for example, 
we know there isn't a selectron with the same mass as the 
electron. Approximate or "broken" supersymmetry still 
predicts that there are selectrons, photinos, sneutrinos, and 
all the rest. But their masses could be quite a bit larger than 
the particles we have discovered so far. It is reasonable to 
assume that most or all of these super-particles (and yes, we 
call them sparticles) have masses within reach ofthe LHC. If 
that's true, then the LHC could be the most prolific discov- 
ery machine in history, turning up not just a handful of new 
fundamental particles, but a dozen or more. 
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A symmetry that demands a collection of new particles 
equal in size to everything presently known might seem like 
a step backward rather than forward. After all, aren't we sup- 
posed to reach for unifying pictures with more explanatory 
power in terms of fewer ingredients? That's exactly how I 
felt about supersymmetry when I first learned about it. But 
here's a comparison worth pondering. The equation for an 
electron, discovered in the 1920s, led to a very unexpected 
prediction: the existence of an anti-electron, more com- 
monly called a positron. Soon, physicists were predicting an 
anti-particle for almost every particle they knew. And they 
found them! For me, supersymmetry doesn't have the same 
aura of inevitability. It’s not needed to describe particles we 
do know, in the way that the equation for the electron was 
needed. But perhaps it's not fair to compare foresight and 
hindsight. 

It's one thing fora particle to exist with a mass in the right 
range for the LHC to find it, and quite another to actually 
discover it. That's because it's complicated to sort through 
all the junk that comes out of a collision and reconstruct 
what happened. It's actually possible that the Tevatron has 
been producing Higgs bosons for years, but reconstructing 
them requires such subtlety that they have escaped notice. 
In fact, physicists generally favor a mass range for the Higgs 
boson of no more than 150 proton masses: lighter than the 
top quark! Sparticles might actually be easier to find at the 
LHC than the Higgs. Gluinos in particular should be pro- 
duced copiously, if they are in an accessible mass range. 
Equally important, they are predicted in many supersym- 
metric theories to go through a spectacular chain of decays 
that should be relatively easy to pick out of the data. In this 
chain of decays, the gluino sheds some of its rest energy by 
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turning into a different type of sparticle. Then that new 
sparticle sheds some of its rest energy in the same way. After 
several such steps, one is left with the lightest sparticle. The 
lightest sparticle is often abbreviated LSP. It’s usually as- 
sumed that the LSP won’t decay at all, but instead will es- 
cape undetected. If all this is right, then what detectors 
at the LHC will observe is not the superpartners, but the 
particles they shed during their decays to the LSP. 

Before telling you more about the LSP, I should men- 
tion one of the unfortunate facts about the LHC: even if 
it discovers things that look like sparticles, it will be tricky 
to say for sure whether they are unambiguous evidence of 
supersymmetry. This is basically because proton-proton col- 
lisions are messy. Lots of particles come out. Known inter- 
actions among quarks and gluons are so strong that they can 
mask new phenomena. And it’s hard to determine the spin 
of a newly discovered particle. For all these reasons, physi- 
cists have advocated building a companion machine to the 
LHC, called the International Linear Collider, or ILC. It 
would collide electrons and positrons. Such collisions pro- 
vide a much cleaner experimental environment. It would 
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be possible to distinguish more cleanly than at the LHC be- 
tween supersymmetry and alternative theories. But the ILC 
is still only a proposal. The dark fate of the Superconducting 
Super Collider shows how hard it is to bring such proposals 
to fruition. 

Let’s get back to supersymmetry. The LSP, if it exists, 
could be the most important discovery of all, because it could 
be the dark matter that draws galaxies together. For decades, 
cosmologists and astronomers have puzzled over the total 
mass of galaxies. They can count the stars in a galaxy (at least 
roughly). From that count, they can estimate how much or- 
dinary matter exists in a galaxy. By ordinary matter, I mostly 
mean protons and neutrons, because they are the main carri- 
ers of mass. The problem is that galaxies never seem to have 
enough mass in ordinary matter to hold together in the way 
that they do. Hence the hypothesis of “dark matter”: there's 
extra stuff that we don’t see in galaxies that was primarily re- 
sponsible for drawing them together in the first place. Based 
on a variety of measurements, many or most cosmologists 
believe that there is five or six times as much dark matter 
in the universe as there is ordinary matter. But what is it? 
Various proposals have been floated, ranging from burnt-out 
stars to subatomic particles. LSPs as dark matter have two 
main virtues. First, in many of the most realistic supersym- 
metric theories, they are very massive (more than 100 times 
the mass of a proton), electrically neutral, and stable—mean- 
ing that they never decay into other particles. Second, it is 
easy to understand how they could have been produced in 
the early universe in approximately the right abundance— 
meaning that they comprise five or six times as much total 
mass today as ordinary matter does. 
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Altogether, supersymmetry is a wonderful theoretical 
framework. It’s well motivated by strange mathematics. It’s 
beautifully consistent with established particle theory, in- 
cluding renormalization. And it predicts a lot of new particles 
that we can hope to see at the LHC. Finally, supersymmetry 
and string theory are so deeply intertwined that it is hard for 
me to believe one could have supersymmetry in the world 
unless string theory in some form is correct. Let me put it 
this way: Supersymmetry is a little like a string duality. It re- 
lates particles to sparticles, just as S-duality relates strings to 
D-branes. Like a string duality, it leaves you wanting more. 
Isn’t there some unifying picture underlying all the particles 
and sparticles? Shouldn’t supersymmetry itself be a hint to 
what that underlying picture should be? String theory pro- 
vides a clean answer, where supersymmetry is built in from 
the start, and where all the particles we know or will dis- 
cover have a more-or-less unified origin in terms of string 
dynamics and extra dimensions. 
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HEAVY IONS AND THE 
FIFTH DIMENSION 


A STRANGE FACT ABOUT THE RELATION BETWEEN SUPERSYM- 
metry and LHC physics is that the main ingredients were ap- 
proximately in place twenty years ago or more. There have 
certainly been advances in the past two decades, both theo- 
retical and experimental. The top quark was a big discovery, 
although it was long anticipated. The non-discovery of the 
Higgs constrains models of supersymmetry in interesting 
ways. Theoretical understanding of supersymmetry has 
deepened considerably, and the range of possible manifesta- 
tions of supersymmetry at the LHC has been explored much 
better than it was in the late 1980s. But these advances have 
been in some sense incremental. Especially now, with the 
LHC about to start generating data, one has the sense of 
the whole field holding its breath. But it's been holding its 
breath a little too long. Supersymmetry is so entrancing that 
it has survived literally decades of non-discovery without 
losing its place as the main hope. Alternative theories tend 
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to get calibrated against supersymmetry to such an extent 
that they start resembling supersymmetry. 

Recently, a wholly different route to connecting string 
theory to the real world has been developed. On the string 
theory side, it is based on the gauge/string duality, which I 
introduced in chapter 6. On the real world side, it relates to 
heavy ion collisions, which I'll describe more in the next 
section. In such collisions, temperature and density rise so 
high that protons and neutrons melt into a fluid called the 
quark-gluon plasma, or QGP for short. There are ways of 
understanding this melted fluid that have nothing to do 
with string theory. The right way to characterize the aim 
of the field is to make string theory one of several quanti- 
tatively useful tools for describing the quark-gluon plasma. 

This is clearly a less lofty aim than to produce a theory of 
everything and reveal the ultimate structure of the physical 
universe. But, at present, the putative connection of string 
theory to heavy ion physics has two charming features that 
are missing in the theory-of-everything side of string the- 
ory. First, the intellectual content on the string theory side 
is firmly rooted in string dynamics and the gauge/string 
duality. This is a more direct access to string theory it- 
self than most theory-of-everything scenarios are likely to 
offer. That's because connections between string theory and 
LHC physics are mostly mediated through supersymmetry 
and the low-energy limit of string theory, where all but the 
lightest string states drop out of the physics. Second, string 
theory calculations have already been compared to experi- 
mental data, with some degree of success. Caution is still 
very much in order, and there are significant critiques and 
disagreements about how and whether string theory relates 
to heavy ion collisions. Nevertheless, this field is producing 


HEAVY IONS AND THE FIFTH DIMENSION 


142 


the closest contact to date between modern string theory 
and experimental physics. 


The hottest stuff on Earth 


The Relativistic Heavy Ion Collider (RHIC) is a particle 
accelerator located on Long Island, not far from New York 
City. Its basic design is similar to the Tevatron and the 
LHC. But it is relatively puny: it can only accelerate sub- 
atomic particles to an energy of about 100 times their rest 
mass. The Tevatron reaches a factor of 1000, and the LHC 
will reach as high as a factor of 7000 for protons. The big 
difference between the Tevatron and RHIC is that RHIC 
accelerates gold nuclei. There are almost 200 nucleons in a 
gold nucleus. (Remember, a nucleon is a proton or a neu- 
tron.) Gold was chosen because it has a big nucleus, and for 
some technical reasons relating to how you start accelerat- 
ing it. When the LHC collides heavy ions, the plan is to 
use lead, which has even a slightly bigger nucleus. There’s 
nothing really special about gold from the standpoint of 
heavy ion collisions. Pll keep talking about it just because 
it’s the substance of choice at RHIC. 

Particle physicists have long been willing to smash anything 
into anything if it promises to teach them something. But past 
preference has leaned toward electrons and positrons. There’s 
a good reason for this: electrons and positrons are small and 
simple compared to atomic nuclei. There is no evidence that 
an electron is anything but a point particle. Positrons are just 
like electrons, only with positive charge. Protons are already 
much more complicated. They contain at least three quarks, 
and probably some gluons. Collectively, these constituents of 
the proton (or, equally of the neutron) are called partons: 
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each one is “part” of a proton. But a proton is more than the 
sum of its partons. The strong interactions among quarks and 
gluons inside the proton are like the cascade of virtual par- 
ticles we discussed in connection with renormalizability. Let 
me remind you how that went. A quark can emit a gluon. It 
does so in the way electrons emit photons. A gluon is like a 
photon, but not entirely. The big difference is that gluons can 
split into other gluons. They can also split into quarks, or join 
with other gluons. All this emission, splitting, and joining is 
the cascade. The particles are called “virtual” because every- 
thing is happening inside the proton. You never actually see a 
single quark or a single gluon on its own: they're always part 
of a proton, or a neutron, or some other subatomic particle. 
Physicists describe this by saying that quarks and gluons are 
confined. They cascade in and out of existence, always inside 
the confines of the proton. 

When you collide protons, one way to think of what hap- 
pens is that each one interrupts the other in the middle of its 
cascade of quarks and gluons. One thing that can happen is 
that a pair of quarks hits very hard. That is the sort of event 
on which the hopes of the LHC hang: a hard process. A lot 
of what happens, though, is that quarks and gluons interact 
more softly. “Softly” here is a relative term. The colliding 
protons are utterly destroyed when they hit. Fifty or more 
particles come out of the collision, most of them unstable. 

To get a feel for these collisions, think of a car crash where 
two cars collide head on. In order to avoid contemplating 
sad and scary things, let’s suppose there aren’t any passengers, 
just crash test dummies. I’m thinking of the cars as analogous 
to colliding protons, and the dummies as analogous to the 
quarks inside each proton. In favorable circumstances, the 
dummies are only lightly damaged even when the cars are 
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totaled. That would be like saying that quarks in one proton 
interact only softly with partons in the other one. In unfavor- 
able cases, a dummy might be badly mangled by a part of the 
car going the wrong way. That’s like a hard collision. Proton- 
proton collisions are typically a hybrid, with some relatively 
hard process plus a lot of soft junk happening around it. 

Let me hasten to add that there’s nothing harmful about oc- 
casional high-energy collisions of subatomic particles. Actu- 
ally, they're happening constantly in the Earth's atmosphere, 
as highly energetic particles rain down on us and hit some 
nucleon or other in the air. What goes on at the Tevatron, 
and what will go on at the LHC, is just a controlled version 
of something that’s been happening literally since the world 
began. Because so many collisions happen in the same place 
at particle accelerators, the environment of the collisions is 
sealed off underground. There would be a lot of radiation 
hazard to a person down there. But by comparison to nuclear 
reactors or atomic weapons, the hazard is relatively mild. 

Collisions of gold nuclei are at first blush quite similar to 
proton-proton collisions. Each nucleus is a big blob of nucle- 
ons, each one undergoing its internal cascade of partons. 
During the collision, some partons may hit each other quite 
hard, while the majority nudge each other more gently. As 
with proton-proton collisions, the gold nuclei are utterly 
destroyed. Literally thousands of particles pour forth from a 
single collision of gold nuclei. 

There is something qualitatively more catastrophic about 
collisions of gold nuclei than of protons. To describe it, let 
me return, a little wincingly, to the car crash analogy. One of 
the worst things that can happen is for one or both gas tanks 
to ignite and explode. Car makers take considerable pains to 
prevent this, for example by positioning the tank where it is 
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An ultra-high-speed collision of gold nuclei creates a quark-gluon 
plasma, which decays into thousands of energetic particles. 


least likely to be punctured. What happens in a gold-gold 
collision is a little like the gas tanks exploding shortly after 
a car collision. There is literally a thermal ball of nuclear 
fire that forms and then blows itself apart. This ball is hotter 
than anything you can imagine. A gas tank explosion might 
reach 2000 Kelvin. The center of the sun is about 16 million 
Kelvin. Similar temperatures can be reached by detonating 
a thermonuclear bomb (an H-bomb). Pretty hot, eh? Well, 
get this: The temperature believed to be attained at RHIC is 
more than 200,000 times hotter than the center of the sun. 
That really takes some thinking about. It’s way more than 
white hot: white hot is just thousands or tens of thousands of 
Kelvin. It’s blindingly, radioactively, primordially hot. Pro- 
tons and neutrons melt in this heat, releasing the quarks and 
gluons inside them. They form the quark-gluon plasma, or 
QGP, which I mentioned earlier in this chapter. 
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In proton-proton collisions, the hard processes that LHC 
physicists will sift for signs of the Higgs boson and supersym- 
metry are obscured by all the soft junk that's happening during 
the same collision. But only a little. When two quarks really 
hit hard, they tend to bounce off in entirely new directions, 
and they head out to the surrounding particle detectors pretty 
much unimpeded by the rest of the proton. In heavy ion col- 
lisions, it’s exactly the opposite: hard processes happen, but 
most of the time the resulting particles seem to get “stuck” in 
the quark-gluon plasma. The extent to which this happens is 
one of the key properties of the quark-gluon plasma. Bullets 
shot into water provide a reasonable analogy. You've probably 
seen movies where James Bond or some similar character is 
dodging bullets while underwater. The bullets are pictured as 
whizzing around him, and you see these long bubbly tracks il- 
luminated in funny ways. Well, the reality is that a bullet will 
penetrate only a few feet into the water. In physicists’ termi- 
nology, bullets in water have a stopping length of a few feet. 
One of the distinctive properties of the quark-gluon plasma is 
that it has a very short stopping length for the particles coming 
from hard processes: only a few times the size of a proton. 

A second distinctive property of the quark-gluon plasma 
is its viscosity. Considering the tremendously high density 
of the QGP, its viscosity is surprisingly small. It takes some 
explaining to understand what this means. On one hand, I 
think viscosity is a familiar concept to anyone who cooks: 
honey and molasses are viscous, water and canola oil are 
much less so. But the contrast one really wants to draw in 
heavy ion physics is between nearly free-streaming particles, 
which are deemed highly viscous, and a strongly interacting 
plasma, which is not. That may seem backwards. Nothing 
could be less viscous than free-streaming particles, right? If 
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no particle hits another, there’s no viscosity, right? Unfortu- 
nately, this is completely wrong. Something with truly small 
viscosity can form flowing layers that slip over one another. 
Water flowing over rocks does this: a layer of water very 
close to the rock moves slowly, but layers above it tumble 
quickly over the stones, lubricated in some sense by the low- 
ermost layer. What if we replaced water with steam but left 
the rocks where they were? Let’s say the steam is constrained 
to follow the streambed: perhaps we put a cover over the 
stream to trap the steam. Now, steam is a bunch of individ- 
ual water molecules that rarely bump into one another. But 
they do bump into the rocks. Unlike water, steam doesn’t 
form layers that slip easily over one another. It’s actually 
harder to get a given mass of steam to flow through a rough 
channel than to get the same mass of water to flow through 
it, because the water is self-lubricating. That's what it means 
to say that water has a lower viscosity than steam. 

Heavy ion collisions create conditions a little like the 
rocky streambed, except without the rocks or the stream. 
(Analogies have their limits!) What I mean is that you can 
tell the difference in heavy ion collisions between a water- 
like substance that’s low-viscosity—in the sense of being 
able to flow freely in slippery layers—and a steamlike sub- 
stance that’s basically just a bunch of particles that rarely hit 
one another. Surprisingly, the best understanding of the data 
comes from assuming very low-viscosity behavior. Tenta- 
tive theoretical estimates of the viscosity, based on quantum 
chromodynamics, were generally well off the mark, predict- 
ing that the quarks and gluons would behave less like water 
and more like steam than they actually do. 

The world of heavy ion physics was shaken when it was 
discovered that black hole horizons have a viscosity that is 
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comparable to the small values one needs in order to un- 
derstand heavy ion data. This discovery was made in the 
framework of the gauge/string duality, which I introduced 
in chapter 6. Subsequent developments seem to indicate that 
many aspects of heavy ion collisions have close analogies in 
gravitational systems. The gravitational systems in question 
always involve an extra dimension. It’s not like the extra di- 
mensions of string theory in its theory-of-everything guise. 
This extra dimension—what I referred to as the fifth di- 
mension in the title of this chapter—is not rolled up. It’s at 
right angles to our usual ones, and we can’t move into it in 
the usual way. What it describes is energy scale—meaning 
the characteristic energy of a physical process. By combin- 
ing the fifth dimension with the ones we know and love, 
you get a curved five-dimensional spacetime. This space- 
time encodes temperature, energy loss, and viscosity in geo- 
metrical ways. Much effort in the past several years has gone 
into deciphering how detailed a correspondence can be 
made between five-dimensional geometries and the physics 
of the quark-gluon plasma. 

In summary: The soft interactions that LHC physicists 
wish weren't there in proton-proton collisions are multiplied 
many times over in heavy ion collisions. They lead to the cre- 
ation of the quark-gluon plasma. The QGP can't be described 
very well in terms of individual particles. Its properties may in 
some ways be better understood in terms of black holes in five 
dimensions, according to the gauge/string duality. 


Black holes in the fifth dimension 


In chapter 6, I gave a brief introduction to the gauge/string 
duality. Let me recapitulate a couple of the main points. 
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A gauge theory similar to quantum chromodynamics de- 
scribes how strings attached to D3-branes interact. Their 
interactions can be made stronger or weaker by changing 
a parameter of that gauge theory. If you make the inter- 
actions very strong, then thermal states are best described 
in terms of a black hole horizon that surrounds the D3- 
branes. This horizon is hard to visualize, because it is an 
eight-dimensional hypersurface in a ten-dimensional ambi- 
ent geometry. A simplification that helps me is to think of 
the horizon as a flat three-dimensional surface that is paral- 
lel to the world we live in, but separated from it in the fifth 
dimension by a distance related to the temperature. The 
larger the temperature of this three-dimensional surface, the 
smaller the separation. This is an imperfect visualization. 
What it leaves out is that the fifth dimension is not like our 
usual four. Four-dimensional experience is like a shadow 
of five-dimensional “reality.” But unlike the shadows you 
see on a sunny day, four-dimensional experience carries no 
less information than the five-dimensional “reality” behind 
it. Four-dimensional and five-dimensional descriptions are 
really equivalent. The equivalence is subtle but precise. It’s a 
metaphor on steroids: every statement you can meaningfully 
make about four-dimensional physics has a five-dimensional 
counterpart, and vice versa—at least in principle. 

Other string dualities have a similar metaphorical quality. 
For example, if you recall, the duality between ten-dimensional 
string theory and eleven-dimensional M-theory includes an 
equivalence between DO-branes and particles moving around 
a circle. The special charm of the gauge/string duality is that 
instead of relating one abstract theory to another, in dimen- 
sions beyond anyone's capacity to visualize, one is dealing 
directly with four-dimensional physics similar to what we 
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know must describe quarks and gluons. So the equivalent ob- 
jects on the five-dimensional side of the duality assume special 
significance. Most importantly for the current discussion, the 
quark-gluon plasma created in a heavy ion collision relates to a 
black hole horizon in five dimensions. What really makes this 
analogy work is that heavy ion collisions produce high enough 
temperatures to melt the nucleons into their constituent 
quarks and gluons. Nucleons themselves are relatively difficult 
to translate into five-dimensional constructions. Individual 
quarks and gluons are also difficult. But the collective behavior 
of a strongly interacting thermal swarm of quarks and gluons is 
easy to translate: the swarm becomes the horizon. 

There is undeniably an elusive quality to the gauge/string 
duality. As well established as it is on technical grounds, it 
is just strange to have a fifth dimension that isn’t really a 
dimension like the ones we know and love. It’s there not so 
much as a physical direction, but as a concept that describes 
aspects of the physics of four dimensions. Ultimately, I’m 
not convinced that the six extra dimensions of string theory 
as a theory of everything will be more tangible than the fifth 
dimension of the gauge/string duality. 

An additional irony is that the temperature of the black 
hole is supposed to be enormous, in sharp contrast to the 
temperature of black holes that might be at the cores of gal- 
axies. Recall our rough estimate from chapter 3: a black hole 
at the core of a galaxy might have a temperature of a hun- 
dredth of a trillionth of a Kelvin. The temperature ofa black 
hole in five dimensions dual to the quark-gluon plasma is 
more like three trillion Kelvin. What makes the difference 
is the curved shape of the five-dimensional geometry. 

If we accept the picture of the thermal swarm of quarks 
and gluons as a horizon in five dimensions, then what? Well, 
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there are lots of things you can do, because the gauge/string 
duality is a computational bonanza. One of the favorite com- 
putations is the viscosity: as computed from the black hole 
geometry, shear viscosity is very small compared to the den- 
sity of the plasma, and this seems to match well with a widely 
accepted interpretation of the data. Some other computations 
have to do with energetic particles, which (as I described 
earlier) cannot penetrate very far through the plasma. This 
phenomenon has an obvious affinity with black hole physics: 
nothing can get out of a black hole. But that’s not quite the 
same thing as saying that nothing can get very far through a 
thermal medium. How should the translation go? 

There’s actually some dispute about the correct answer at 
the time I am writing this book. I'll explain only one side of 
the story, and I'll hint a little bit at what the dispute is about. 

The side of the story I'll explain hinges on the idea of the 
“QCD string.” This is such an important and well-accepted 
concept that lIl back off a little bit and explain where it 
comes from. First, let me remind you that electrons produce 
a cloud of virtual photons. These photons can be described 
in terms of an electric field. Actually, any charged object 
produces an electric field. For example, a proton does. The 
electric field surrounding a proton tells other protons which 
way to move in response to the first one. Protons repel one 
another electrically. The electric field represents that by 
pointing outward, all around the proton. Protons attract 
electrons, and this is described by the same electric field: 
it’s just that electrons, being negatively charged, perceive a 
given electric field in the opposite way that protons do. 

Quarks are profoundly similar to electrons, yet also pro- 
foundly different. They produce a cloud of virtual gluons 
that can be understood as a “chromo-electric field," which 
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Top: the electric field ofa proton points radially outward. Bottom: the 
chromo-electric field generated by a quark forms into a QCD string 
that can end on an anti-quark. 


tells other quarks which way to move. So far, this is very 
similar to electrons. But the virtual gluons interact strongly 
with one another, which is wholly unlike photons. Because 
of these interactions, the chromo-electric field channels it- 
self into a narrow string—the QCD string—which stretches 
from one quark to another. There are objects called mesons 
that are understood to be well described in these terms: two 
quarks linked by a QCD string. By studying the properties 
of mesons, you can infer some of the dynamics of the QCD 
string, and it is in some ways quite similar to the strings of 
string theory. In fact, such studies are older than QCD or 
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string theory! They provided the first fodder for speculation 
that strings could describe aspects of subatomic physics. The 
modern incarnation of those speculations is one aspect of the 
gauge/string duality and its relation to QCD. What’s really 
different between modern string theory and QCD is that 
the string is regarded as the fundamental object, whereas 
the QCD string is a collective effect of many virtual glu- 
ons. However, a lesson of string dualities is not to hold too 
rigidly to one theoretical construction as fundamental and 
another as derived: as circumstances vary, so too may the 
most convenient language to describe reality. 

Now imagine a quark that was produced in a hard pro- 
cess and is plowing through the quark-gluon plasma, like a 
bullet plowing through water. The ideas behind the QCD 
string should still have some currency: the quark surrounds 
itself with virtual gluons, these gluons interact among 
themselves, and some collective tendency toward forming 
a QCD string might arise. But there’s something else going 
on: all the quarks and gluons in the thermal swarm are in- 
teracting with the original quark, as well as with any vir- 
tual gluons it might produce. This thermal swarm prevents 
the QCD string from fully forming. Altogether, the picture 
might look a little bit like a tadpole: the original quark 
is like the head, and its attempts to make a QCD string 
are the tail. The way the tail wiggles and flails through 
the water is like how the thermal swarm interacts with the 
virtual gluons. This picture has not been made precise or 
quantitative (as far as I know) in QCD itself. But there is 
something similar to it in the gauge/string duality. A string 
dangles down from a quark into the black hole horizon. As 
the quark moves forward, the string is pulled forward. But 
the end that it trails into the black hole horizon is in some 
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A quark moving through a thermal medium like the quark-gluon 
plasma trails a string down into the fifth dimension, where it eventually 
crosses through a black hole horizon. When the quark moves, the 
string trails behind it and produces a drag force on the quark. 


sense stuck. The string pulls back on the quark because it 
can't get its other end free of the black hole. Eventually, the 
quark either gives up and stops moving, or falls into the 
black hole itself. Either way, it doesn't get very far. 

The picture I just described is supposed to work best for 
heavy quarks. Examples of heavy quarks are the charm quark, 
with a mass about 50% more than a proton, and a bottom 
quark, with a mass more than four times as much as a proton. 
Such quarks are almost entirely absent from normal matter, 
but they are produced in heavy ion collisions. The "ordinary 
quarks" in normal matter, together with anti-quarks of the 
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same masses, are produced in heavy ion collisions far more 
abundantly than heavy quarks. There are attempts to extend 
the picture of a quark trailing a string to the case of ordinary 
quarks, but they are somewhat tentative so far. 

The bottom line is that the gauge/string duality provides 
an estimate of how far a heavy quark propagates through a 
thermal medium similar to the quark-gluon plasma. With 
such an estimate in hand, the next task is to find out whether 
it agrees with data. 

There are two reasons why this is tricky. First, experimen- 
talists can’t train a microscope on the quark-gluon plasma and 
watch as a heavy quark trundles along and then stops. Instead, 
their little ball of plasma, including the heavy quark, com- 
pletely blows itself apart in a time comparable to the time it 
takes light to traverse a gold nucleus. That’s a very, very short 
time: about 4 x 10? seconds, which is forty trillionths of a 
trillionth of a second. The only things they get to look at are 
the thousands of particles that come out. It's pretty impressive 
that they can infer how the charm quark interacted with the 
medium from inspecting the debris. I think experimentalists 
would caution that such inferences have to be taken with a 
grain of salt. They can be 99.99% confident of their measure- 
ments, and yet be considerably less certain about how far the 
average charm quark gets through the plasma. 

The second reason why it is tricky to compare a predic- 
tion of the gauge/string duality with data is that the string 
theory computations apply to a theory that is only similar to 
QCD, not to QCD itself. The theorist has to make some 
translation between one and the other before he or she has 
a definite prediction to give an experimentalist. In other 
words, there's some fudge. The best attempts to handle this 
translation honestly lead to predictions for the charm quark's 
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stopping distance that are either in approximate agreement 
with data, or perhaps as much as a factor of 2 smaller. A sim- 
ilar comparison can be made for viscosity, and the upshot is 
that the gauge/string duality produces a result that is either 
in approximate agreement with data, or perhaps a factor of 
2 away from agreement. 

All this doesn’t sound like a reason to break out the cham- 
pagne. And yet, agreement between modern string theory 
and modern experiment to within a factor of 2 is a tremen- 
dous novelty for high-energy physics. Fifteen years ago, all 
the string theorists were toiling away in extra dimensions, 
and all the heavy ion experimentalists were busy building 
their big detectors. None of us were even able to imagine 
the kind of calculations I’ve described. Now we study each 
other’s papers, we go to the same conferences, we worry 
about factors of 2, and we try to figure out what to do next. 
That’s progress. 

Earlier, I mentioned a dispute about how to translate 
the stopping of an energetic quark into a process involving 
strings and black holes. This controversy is not about fac- 
tors of 2 here or there. Instead, it’s about the physical picture 
one should have in mind in describing the energetic quark. 
The picture I’ve described involves a string trailing out from 
the quark, down into the fifth dimension, and into the black 
hole horizon. The competing picture is more abstract, but it 
essentially relies on a U-shaped string configuration where 
the bottom of the U just grazes the horizon. For lack of bet- 
ter terminology, I'll refer to the two pictures as the trail- 
ing string and the U-shaped string. A virtue of the latter is 
that it purports to describe ordinary quarks. That’s good be- 
cause they’re far more abundant, hence easier to study. The 
U-shaped string leads to predictions about quark energy loss 
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that are once again either on target or within about a factor 
of 2. The trouble is that the “fudge factors” for U-shaped 
strings and for trailing strings tend to get chosen differently. 
Furthermore, proponents of each picture have made specific 
criticisms of the other. This is not an easy debate to settle. 
The questions are abstract, the hypotheses are a little differ- 
ent, and the agreement with data is only expected to be ap- 
proximate. Still, I would take it as a sign of new-found health 
that string theorists are debating the relative merits of calcu- 
lations that can be compared at least approximately with data. 

What is the future? For heavy ion collisions, I think the 
answer is that more is better. The more calculations that 
string theorists can perform, the more handles they should 
have on the difficult problem of translation. The aim is a 
reasonably coherent and consistent mapping between five- 
dimensional constructions and experimentally measurable 
quantities. It could be that this program hits a roadblock at 
some point: there could just be some insuperable difference 
between the string theory constructions and real world 
QCD. So far that hasn’t happened. It could also happen 
that the string theory calculations peter out because of in- 
sufficient ability to handle the technical difficulties. String 
theory does seem to go in fits and starts: a lot of progress, 
then relative stagnation, then more progress. 

Experiments at the LHC will include slamming lead nu- 
clei together at substantially higher energies than RHIC 
can reach. (Remember, for purposes of heavy ion collisions, 
lead and gold are nearly identical.) The data from these col- 
lisions should provide a large new stimulus to theoretical 
approaches—related or unrelated to string theory. Among 
the many advances we can expect, heavy ion collisions at the 
LHC will produce heavy quarks in much greater abundance 
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than the ones at RHIC. Moreover, the detectors at the LHC 
are more advanced than the ones at RHIC. So it’s reasonable 
to hope that greater clarity on the physical picture of energy 
loss from fast-moving quarks will emerge from the LHC. 

It is fair to say, though, that the main suspense surround- 
ing the LHC, is: What new particles will it discover? What 
new symmetries? Proton-proton collisions are far and away a 
better setting for such discoveries than heavy ion collisions, 
both because the energies per proton are higher and because 
the environment is less noisy. Prognosticating LHC discov- 
eries is, naturally, more than a hobby among theorists. By the 
time you read this book, you probably will know more than 
I do now. But IIl hazard this guess: unless we're lucky, the 
discoveries won't flash like lightning bolts across the sky. The 
experiments are hard, the theories are abstract, and match- 
ing the two may well involve difficulties and controversies 
that are sharper than the ones I’ve described in this chapter. 
Even if a few discoveries are made right away, fitting every- 
thing into a coherent picture is probably going to be a long 
and confusing process. Because of its achievements to date, 
because of its rich mathematical structure, and because of the 
way it twines through such a broad swath of other theoretical 
ideas, from quantum mechanics to gauge theory to gravity, 
I expect that string theory will be a crucial part of the final 
answer. 
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THERE ARE MANY ASPECTS OF STRING THEORY WE COULD 
ponder after the tour of the subject we have just completed. 
We could ponder the peculiar demands it seems to make 
on spacetime, like ten dimensions and supersymmetry. We 
could ponder the peculiar objects whose existence it requires, 
everything from DO-branes to end-of-the-world branes. We 
could ponder its tenuous but improving connection to ex- 
perimental physics. We could also weigh the controversy that 
it generates: Is string theory worthwhile? Overhyped? Un- 
duly maligned? 

As fascinating as all these topics are, the topic that I think 
is most worth ending with is the mathematics that forms 
the core of string theory. Readers of my generation may 
remember the Wendy's commercial where the short grey- 
haired lady demands, "Where's the beef?" Well, in string 
theory, the beef is in the equations. Almost all the equations 
of string theory involve calculus, which puts them out of 
reach of a popular account. So I’ve tried to take a handful 
of important equations, roughly following the topics we've 
covered in chapters 5 through 8, and put them into words. 

The most basic formula in string theory is the equation 
for how strings prefer to move. What this equation says is 
that strings try to move through spacetime in such a way 
that the area of the surface they sweep out is minimized. 
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This motion takes no account of quantum mechanics. 
There is another equation—really a group of equations— 
that explains how to include quantum mechanics in the 
motion of strings. These equations say that any motion of 
the string is possible, but that motions that are only slightly 
different from the area-minimizing motion reinforce one 
another. What I mean by "reinforcing" is illustrated by a 
Roman fasces: a bundle of sticks all lined up with one an- 
other. Such a bundle is very strong, much stronger than 
each stick. Each possible motion of a string is like a single 
stick. Most are scattered in a disorganized way. But motions 
of a string that are close to the area-minimizing motion are 
"aligned" in a way that makes them dominate the equations 
describing the quantum mechanics of strings. 

The equations describing D-branes are variants of the 
ones that describe strings. Their most distinctive feature is 
that when many D-branes are grouped close together (again 
like a Roman fasces), they have more ways of moving than 
there are dimensions of spacetime. Whenever the D-branes 
separate significantly, ten-dimensional spacetime describes 
their relative positions. But when the D-branes are close 
enough, it takes a gauge theory to describe their motion. 
The equations of gauge theory say that strings stretching 
between pairs of branes, like I drew on p. 90, can't reliably 
be said to go from a “red” brane to a “blue” brane, or from 
"green" to "red." Instead, all such possibilities can be super- 
posed in a single colorful wave function, the way melody 
and harmony in the Fantasie-Impromptu combine without 
losing their separate identities. 

The equations of string dualities have a remarkably frag- 
mentary quality. The ones that enter at the level of supergravity 
are surprisingly simple, usually expressing some symmetry re- 
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lation. The ones describing strings and branes are quantum 
mechanical, but still pretty simple: the most common sort of 
equation in this context says that the electric charges (or ana- 
logs of electric charges) of branes must take on integer values in 
appropriate units. There are many, many further equations in 
string dualities, commonly arising from carefully tracing out 
how the intuitive relations we have discussed can be rendered 
quantitative. An example is the calculation of how the quan- 
tum fluctuations of a clump of DO-branes contribute to the 
mass of the clump. The answer—that they don’t contribute at 
all—was anticipated based on the duality with M-theory long 
before it was demonstrated conclusively with equations. 

The equations of supersymmetry start with relations like 
a X a= 0. This equation has several meanings. It means that 
there are only two states of motion in a fermionic dimension: 
moving or stopped. It also means that two fermions cant 
occupy the same state (the exclusion principle), as we dis- 
cussed for electrons in a helium atom. Supersymmetry goes 
on from simple relations like a x a — 0 to truly profound 
equations that have helped shape modern mathematics. 

The equations describing black holes and the gauge/string 
duality come mostly in two varieties. The first sort of equa- 
tion is a differential equation. These equations describe the 
detailed moment-to-moment behavior of a string or a par- 
ticle in spacetime, or of spacetime itself. The second sort of 
equation has a much more global flavor. You describe what 
is happening in a whole swath of spacetime, all in one go. 
The two types of equations are often closely related. For ex- 
ample, there's a differential equation that basically amounts to 
a particle saying, “I’m falling!” And there's a global equation 
describing a black hole horizon that basically says, “Fall across 
this line and you'll never make it back out." 
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As important as mathematics is to string theory, it would 
be a mistake to regard string theory as just a big collection 
of equations. Equations are like brushstrokes in a painting. 
Without brushstrokes there would be no painting, but a 
painting is more than a big collection of brushstrokes. With- 
out a doubt, string theory is an unfinished canvas. The big 
question is, when the blanks get filled in, will the resulting 
picture reveal the world? 
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5-6, 141, 145-55; racecar analogy 
and, 63-68, 97; renormalization 
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5—6, 141—43, 145—55; Relativistic 
Heavy Ion Collider (RHIC) and, 
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any value of f,, we need to allow for the detailed temperature dependence of the 
axion mass. Using our formula for the axion potential as a function of tempera- 
ture we can ask when the associated mass becomes of order the Hubble constant. 
After that time, axion oscillations are more rapid than the Hubble expansion, so 
we might expect that the axion density will damp, subsequently, like matter. Let’s 
take, specifically, f, = 10!! GeV. For the axion mass, we can take: 
Agcp \*" 

mT) ¥ 0.1m,(T = 0) (A22) : (19.58) 
The axion then starts to oscillate when T ~ 1.5 GeV. At this time, it represents 
about 10~° of the energy density. One needs to do a bit more work to show that in 
the subsequent evolution, the energy in axions relative to the energy in radiation falls 
roughly as 1/7, but for this decay constant, the axion and radiation energies become 
equal at roughly 1 eV. If the decay constant is significantly higher than 10!! GeV, 
then the axions start to oscillate too late, and dominate the energy density too early. 
If the decay constant is significantly smaller, then the axions cannot constitute the 
presently observed dark matter. 

So it is remarkable that there is a rather narrow range of axion decay constants 
consistent with observation. On the other hand, some of the assumptions we have 
made in this section are open to question. In the case of hybrid inflation we have 
seen that the universe might never have been much hotter than 10 MeV. In this case, 
the upper limit on the axion decay constant, as we will discuss further later, can be 
much weaker. 


19.3 The LSP as the dark matter 


A stable particle is not necessarily a good dark matter candidate. But we can make 
a crude calculation which indicates that the LSP density is in a suitable range to 
be the dark matter. Consider particles, X, with mass of order 100 GeV interacting 
with weak interaction strength. Their annihilation and production cross sections go 
as G2E >. So, in the early universe, the corresponding interaction rate is of order 


T © pyGpE* © pyGPT”. (19.59) 


These interactions will drop out of equilibrium when the mass of the particle X is 
small compared to the temperature, so that there is a large Boltzmann suppression 
of their production. This will occur when this rate is of order the expansion rate, or 
T2 
T3e MXIT (yo) ~ —_., (19.60) 
P 
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Since the exponent is very small, once T ~ 10Mx, we can get a rough estimate of 
the density by saying that: 


e Mx/TT3 ~ Ge? /Mp. (19.61) 
The ratio, nx/s, then, is of order 
nx/s © Gz? /(M,T?). (19.62) 


Assuming that My ~ 100 GeV, and T ~ 10GeV, this gives about 10~° for the 
right-hand side. Since the energy density in radiation damps like T~*, while that 
for matter damps like T~*, this gives matter-radiation equality at temperatures of 
order an electronvolt, as in the standard big bang cosmology. 

Needless to say, this calculation is quite crude. Extensive, far more sophisticated, 
calculations have been done to find the regions of parameter space in different su- 
persymmetric models which are compatible with the observed dark matter density. 
The basic starting point for these analyses is the Boltzmann equation. If the basic 
process is of the form 1 + 2 = 3 + 4, then: 


dpi d? py d? p3 d? pa 
(27)32E, J 2x)32E,) Qny2E;) 2nx)32Ey 


d 
a *—(nia’) = 


(AAU + fll + fl — AAU + AL + fAQx)*8(r1 + po — ps — pa)lM?. 

(19.63) 
The functions f|,..., f4 are the distribution functions for the different species. 
These equations can be simplified in the high-temperature limit using Boltzmann 
statistics: 


f(E) > e/TeH F/T, (19.64) 


Interactions are still fast enough at this time to maintain equilibrium of the X 
momentum distributions (kinetic equilibrium) but not that of X number. So it is the 
limiting value of the X chemical potential, jz x, which we seek. In this limit, we have: 


Af = Alls fhl-ffll+ AlN = fal 


a ge rit (liar iayle = elstHa/P) (19.65) 


Here we have used FE; + Ey = E34 E4. 

Things simplify further as all but the X particle (particle 1) are light, and nearly in 
equilibrium. Defining n® as the distributions in the absence of a chemical potential, 
and defining the thermally averaged cross section: 


1 d*p, d> po d>p3 d>p4 2 
ORO) 3 3 3 aaa 

ny Ns (27)?2E, (27)?2E, J (2m)?2E3 J (27)?2E4 
(19.66) 
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Fig. 19.3. Reprinted from J. R. Ellis et al., Supersymmetric dark matter, Phys. Lett. 
B, 565, 176 (2003), Figure 9. Copyright 2003, with permission from Elsevier. 


we have 


3 

ee = nO ny(ov) ( = “) (19.67) 
x 

Detailed solutions of these equations (often without some of these simplifications) 

reveal, as one would expect, a range of parameters in the MSSM compatible with 

the observed dark matter density (Fig 19.3). 

So while it is disturbing that we need to impose additional symmetries in the 
MSSM in order to avoid proton decay, it is also exciting that this leads to a possible 
solution of one of the most critical problems of cosmology: the identity of the dark 
matter. 


19.4 The moduli problem 


We have seen that, in supersymmetric theories, there are frequently light moduli. 
In string models, we will find that such fields are ubiquitous. Such moduli, if they 
exist, pose a cosmological problem with some resemblance to the problems of axion 
cosmology. 

In this section, we will formulate the problem as it arises in gravity-mediated 
supersymmetry breaking. The potential for the modulus, ¢, would be expected to 
take the form: 


V(p) = m3.M5 f(G/Mp). (19.68) 
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By assumption, f has a minimum at some value, ¢, of order M,. In the early 
universe, when the Hubble parameter is much greater than m3,/2, this potential is 
effectively quite small, and there is in general no obvious reason that the field 
should sit at its minimum. So, when H ~ m3), the field is likely to lie at a distance 
M,, in field space from the minimum, and store an energy of order m; M3. Like 
the axion, after this time, assuming it is within the domain of attraction of the 
minimum, it will oscillate, behaving like pressureless dust. Almost immediately, 
given our assumptions about scales, it comes to dominate the energy density of the 
universe, and it continues to do so until it decays. The problem is that the decay 
occurs quite late, and the temperature after the decay is likely to be quite low. 

We can estimate the temperature after @ decay, T,, by considering first the lifetime 
of the @ particle. We might expect this to be: 


3 
3/9 


iS 
2 
M, 


(19.69) 
assuming that the couplings of the ¢ field to other light fields are suppressed by 
a single power of M@. Assuming that the decay products quickly thermalize, and 
noting that I°y is the Hubble constant at the time of @ decay, gives 

T4 a m3 9 


r 


2 4 
Ms (10 keV)". (19.70) 
Here we are assuming 3/2 ~ 1 TeV. This is a temperature well below nucleosyn- 
thesis temperature. So in such a picture, the universe is matter dominated during 
nucleosynthesis. But the situation is actually far worse: the decay products almost 
certainly destroy deuterium and the other light nuclei. 

Two plausible resolutions for this puzzle have been put forward (apart from the 
obvious one that perhaps there is no supersymmetry or no moduli): the moduli 
might be significantly more massive than 1 TeV. Note that T, scales like the moduli 
masses to the 3/2 power, so if the moduli masses are of order hundreds of TeV, this 
temperature can be sufficiently high that nucleosynthesis occurs (again). There is, 
potentially, a serious problem with such a picture. During the @ decay, significant 
entropy is produced. For example, if the temperature of the universe when the 
moduli came to dominate was \/m3/2M,, when the moduli decay it is far smaller 
than 7,. Correspondingly, the entropy is increased by a factor which can easily 
be 10°-10!*. Any baryon number produced before these decays is diluted by a 
corresponding factor. One can hope that the baryons are produced in the decays of 
these moduli, but this requires one understand why such low-energy baryon-number 
violation doesn’t cause difficulties for proton decay. In the next section, we will 
discuss possible mechanisms to produce the baryon asymmetry, and we will see 
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that there is one which is capable of producing a large enough asymmetry to survive 
moduli decays. 


19.5 Baryogenesis 


The baryon to photon ratio, ng/n,, 1s quite small. At early times, when QCD was 
nearly a free theory, this slight excess would have been extremely unimportant. But 
for the structure of our present universe, it is terribly important. One might imagine 
that ng/n, is simply an initial condition, but it would be more satisfying if we 
could have some microphysical understanding of this asymmetry between matter 
and antimatter. Andrei Sakharov, after the experimental discovery of CP violation, 
was the first to state precisely the conditions under which the laws of physics could 
lead to a prediction for the asymmetry. 


(1) The underlying laws must violate baryon number. This one is obvious; if there is, for 
example, no net baryon number initially, and if baryon number is not conserved, the 
baryon number will always be zero. 

(2) The laws of nature must violate CP. Otherwise, for every particle produced in interac- 
tions, an antiparticle will be produced as well. 

(3) The universe, in its history, must have experienced a departure from thermal equilibrium. 
Otherwise, the CPT theorem insures that the numbers of baryons and anti-baryons at 
equilibrium are zero. This can be proven with various levels of rigor, but one way 
to understand this is to observe that CPT insures that the masses of the baryons and 
anti-baryons are identical, so at equilibrium their distributions should be the same. 


Subsequently, there have been many proposals for how the asymmetry might 
arise. In the next sections, we will describe several. Leptogenesis relies on lepton- 
number violation, something we know is true of nature, but of whose underly- 
ing microphysics we are ignorant. Baryogenesis through coherent scalar fields 
(Affleck—Dine baryogenesis) also seems plausible. It is only operative if supersym- 
metry is unbroken to comparatively low energies, but it can operate quite late in the 
evolution of the universe and can be extremely efficient. This could be important in 
situations like moduli decay or hybrid inflation where the entropy of the universe 
is produced very late, after the baryon number. 


19.5.1 Baryogenesis through heavy particle decays 


One well-motivated framework in which to consider baryogenesis is grand unifica- 
tion. Here one can satisfy all of the requirements for baryogenesis. Baryon-number 
violation is one of the hallmarks of GUTs, and these models possess various sources 
of CP violation. As far as departure from equilibrium, the massive gauge bosons, 
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Fig. 19.4. Tree and loop diagrams whose interference can lead to an asymmetry 
in heavy particle decay. 


X, provide good candidates for a mechanism. To understand in a bit more detail 
how the asymmetry can come about, note that CPT requires that the total decay 
rate of X is the same as that of its antiparticle X. But it does not require equality of 
the decays to particular final states (partial widths). So starting with equal numbers 
of X and X particles, there can be a slight asymmetry between processes such as 


X > dL;X > Oi (19.71) 
and 
X > dLl:X > Qu. (19.72) 


The tree graphs for these processes are necessarily equal; any CP-violating phase 
simply cancels out when we take the absolute square of the amplitude (see Fig. 19.4). 
This is not true in higher order, where additional phases associated with real in- 
termediate states can appear. Actually computing the baryon asymmetry requires 
an analysis of the Boltzmann equations, of the kind he have encountered in our 
discussion of dark matter. 

There are reasons to believe, however, that GUT baryogenesis is not the origin 
of the observed baryon asymmetry. Perhaps the most compelling of these has to 
do with inflation. Assuming that there was a period of inflation, any pre-existing 
baryon number was greatly diluted. So in order that one produces baryons through X 
boson decay, it is necessary that the reheating temperature after inflation be at least 
comparable to the X boson mass. But as we have explained, a reheating temperature 
greater than 10? GeV leads to cosmological difficulties, especially overproduction 
of gravitinos. 


19.5.2 Electroweak baryogenesis 


The Standard Model, for some range of parameters, can satisfy all of the conditions 
for baryogenesis. We have seen in our discussion of instantons that the Standard 
Model violates baryon number. This violation, we saw, is extremely small at low 
temperatures, so small that it is unlikely that a single baryon has decayed in the his- 
tory of the universe this way. The rate is so small because baryon-number violation 
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is a tunneling process. If one could excite the system to high energies, one might 
expect that the rate would be enhanced. At high enough energies, the system might 
simply be above the barrier. One can find the configuration which corresponds to 
sitting on top of the barrier by looking for static, unstable solutions of the equations 
of motion. Such a solution is known. It is called a sphaleron (from Greek, meaning 
“ready to fall’). The barrier is quite high — from familiar scaling arguments, the 
sphaleron energy is of order E,, = 1/aMy. But this configuration is large com- 
pared to its energy; it has size of order My. As a result, it is difficult to produce in 
high-energy scattering. Two particles with enough energy to produce the sphaleron 
have momenta much higher than My. As a result, their overlap with the sphaleron 
configuration is exponentially suppressed. 

Athigh temperatures one might expect that the sphaleron rate would be controlled 
by a Boltzmann factor, e~»/" . So as the temperature increases, the rate would grow 
significantly. This turns out to be the case. In fact, the rate is even larger than one 
might expect from this estimate, because F,, itself is a function of T. At very high 
temperatures, the rate has no exponential suppression at all, and behaves as: 


Tl =(a,T7)*. (19.73) 


These phenomena are discussed in Appendix C. 

If the Higgs mass is not too large, the Standard Model can produce a significant 
departure from equilibrium. As one raises the temperature, a simple calculation, 
described in Appendix C, shows that the Higgs mass increases (the mass-squared 
becomes less negative) with temperature. At very high temperature, the SU(2) x 
U(1) symmetry is restored. The phase transition between these two phases, for a 
sufficiently light Higgs, is first order. It proceeds by the formation of bubbles of 
the unbroken phase. The surfaces of these bubbles can be sites for baryon number 
production. These phenomena are also discussed in Appendix C. So the third of 
Sakharov’s conditions can be satisfied. 

Finally, we know that the Standard Model violates CP. We also know, however, 
that it is crucial that there are three generations, and that this CP violation vanishes 
if any of the quark masses are zero. As a result, even if the Higgs mass is small 
enough that the transition is strongly first order, any baryon number produced is 
suppressed by several powers of Yukawa couplings, and is far too small to account 
for the observed matter—antimatter asymmetry. 

In the MSSM, the situation is somewhat better. There is a larger region of the 
parameter space in which the transition is first order, and as we have seen, there 
are many new sources of CP violation. As a result, there is, as of this writing, a 
small range of parameters where the observed asymmetry could be produced in this 
way. 


290 19 Astroparticle physics and inflation 


19.5.3 Leptogenesis 


There is compelling evidence that neutrinos have mass. The most economical ex- 
planation of these masses is that they arise from a seesaw, involving gauge singlet 
fermions, N,. These couplings violate lepton number. So according to Sakharov’s 
principles, we might hope to produce a lepton asymmetry in their decays. Because 
the electroweak interactions violate baryon and lepton number at high temperatures, 
the production of a lepton number leads to the production of baryon number. 

In general, there may be several N, fields, with couplings of the form: 


Lw = MapNaNp t+ hai LiNat c.c. (19.74) 


In a model with three Ns, there are CP-violating phases in the Yukawa couplings 
of the Ns to the light Higgs. The heaviest of the right-handed neutrinos, say N1, 
can decay to ¢ and a Higgs, or to @ and a Higgs. At tree level, as in the case of GUT 
baryogenesis, the rates for production of leptons and anti-leptons are equal, even 
though there are CP-violating phases in the couplings. It is necessary, again, to look 
at quantum corrections, in which dynamical phases can appear in the amplitudes. 
At one loop, the decay amplitude for N has a discontinuity associated with the 
fact that the intermediate N, and N> can be on shell (similar to Fig. 19.4). So one 
obtains an asymmetry proportional to the imaginary parts of the Yukawa couplings 
of the Ns to the Higgs: 


T(N 0H) —T(N CH. 12 M2 
fe (N, > €H) (Ni > e 12) 7 3 Im[(h hi) Ff (75). 
P(N, > €Hr) + P(N > An) 8 hhi 4H, Mi? 


(19.75) 


where f is a function that represents radiative corrections. For example, in the Stan- 
dard Model f = /x[(x — 2)/@ — 1) +(« + 1) Ind + 1/x)], while in the MSSM 
f = Jx[2/(x — 1)+In(1 + 1/x)]. Here we have allowed for the possibility of 
multiple Higgs fields, with Hy coupling to the leptons. The rough order of mag- 
nitude here is readily understood by simply counting loops factors. It need not be 
terribly small. 

Now, as the universe cools through temperatures of order of masses of the Ns, 
they drop out of equilibrium, and their decays can lead to an excess of neutrinos 
over antineutrinos. Detailed predictions can be obtained by integrating a suitable 
set of Boltzmann equations. But a rough estimate can be obtained by noting that 
the N,s drop out of equilibrium when their production rate becomes comparable 
to the expansion rate of the universe. If w represents a typical coupling, this occurs 
roughly when 


morTe MN/T ~ —_, (19.76) 
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Assuming that in the polynomial terms, T ~ My//10, gives that the density at this 
time is of order 


PN nT 


: (19.77) 
Prot Mpa? 
Multiplying by €, the average asymmetry in N decays, this estimate suggests a 
lepton number — and hence a baryon number -— of order: 
PB Mn 


x ; 19.78 
Prot 10207M, ( ) 


We have seen that € is suppressed by a loop factor and by Yukawa couplings. So 
this number can easily be compatible with observations, or even somewhat larger, 
depending on a variety of unknown parameters. 

These decays, then, produce a net lepton number, but not baryon number 
(and hence a net B — L). The resulting lepton number will be further processed 
by sphaleron interactions, yielding a net lepton and baryon number (recall that 
sphaleron interactions preserve B — L, but violate B and L separately). One can 
determine the resulting asymmetry by an elementary thermodynamics exercise. 
One introduces chemical potentials for each neutrino, quark and charged lepton 
species. One then considers the various interactions between the species at equi- 
librium. For any allowed chemical reaction, the sum of the chemical potentials on 
each side of the reaction must be equal. For neutrinos, the relations come from: 


(1) the sphaleron interactions themselves 


D> (34a, + He,) = 0; (19.79) 


uj 


(2) a similar relation for QCD sphalerons 


Yo (2a, — Hu, — Ha) = 0; (19.80) 


L 


(3) vanishing of the total hypercharge of the universe 


2 
Do (Hay = 2a + Hay — Me, + Ha,) + Hen = 05 (19.81) 


U 


(4) the quark and lepton Yukawa couplings give relations 
Hg, — Md — ba; =9, Mg — he — Ma; =9, Me, — he — Me; =9. = (19.82) 


The number of equations here is the same as the number of unknowns. Combining 
these, one can solve for the chemical potentials in terms of the lepton chemical 
potential, and finally in terms of the initial B — L. With N generations, 


8N +4 
pes os. (19.83) 
QIN +13 
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Reasonable values of the neutrino parameters give asymmetries of the order we 
seek to explain. Note sources of small numbers. 


(1) The phases in the couplings. 

(2) The loop factor. 

(3) The small density of the N particles when they drop out of equilibrium. Parametrically, 
one has, e.g., for production, 


Pee eT (19.84) 


which is much less than H ~ T*/M,p once the density is suppressed by 7'/ Mp, i.e. of 
order 10~° for a 10! GeV particle. 


It is interesting to ask: assuming that these processes are the source of the ob- 
served asymmetry, how many parameters which enter into the computation can 
be measured? It is likely that, over time, many of the parameters of the light neu- 
trino mass matrices, including possible CP-violating phases, will be measured. But 
while these measurements determine some of the N; couplings and masses, they 
are not, in general, enough. In order to give a precise calculation, analogous to 
the calculations of nucleosynthesis, of the baryon number density, one needs ad- 
ditional information about the masses of the fields N;. One either requires some 
other (currently unforeseen) experimental access to this higher-scale physics, or 
a compelling theory of neutrino mass in which symmetries, perhaps, reduce the 
number of parameters. 


19.5.4 Baryogenesis through coherent scalar fields 


In supersymmetric theories, the ordinary quarks and leptons are accompanied by 
scalar fields. These scalar fields carry baryon and lepton number. A coherent field, 
i.e. a large classical value of such a field, can in principle carry a large amount of 
baryon number. As we will see, it is quite plausible that such fields were excited in 
the early universe, and this can lead to a baryon asymmetry. 

To understand the basics of the mechanism, consider first a model with a single 
complex scalar field. Take the Lagrangian to be 


L= |d,6 — me). (19.85) 


This Lagrangian has a symmetry, ¢ — e'“@, and a corresponding conserved cur- 
rent, which we will refer to as baryon number: 


jp = i(6"0"o — ga"'g"). (19.86) 
It also possesses a “CP” symmetry: 
oo. (19.87) 


With supersymmetry in mind, we will think of m as of order My. 
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If we focus on the behavior of spatially constant fields, #(x, t) = (ft), this system 
is equivalent to an isotropic harmonic oscillator in two dimensions. In field theory, 
however, we expect that higher-dimension terms will break the symmetry. In the 
isotropic oscillator analogy, this corresponds to anharmonic terms which break 
the rotational invariance. With a general initial condition, the system will develop 
some non-zero angular momentum. If the motion is damped, so that the amplitude 
of the oscillations decreases, these rotationally non-invariant terms will become 
less important with time. 

In the supersymmetric field theories of interest, supersymmetry will be broken by 
small quartic and higher-order couplings, as well as by soft masses for the scalars. 
So as a simple model, take: 


Lr=Alolt +e @o* +o d+ +c. (19.88) 


These interactions clearly violate “B.’ For general complex € and o, they also 
violate CP. As we will shortly see, once supersymmetry is broken, quartic and 
higher-order couplings can be generated, but these couplings A,€,o0... will be 
extremely small, O(m3./M;) or O(m3>/Mz.,.)- 

In order that these tiny couplings lead to an appreciable baryon number, it is 
necessary that the fields, at some stage, were very large. To see how the cosmic 
evolution of this system can lead to a non-zero baryon number, first note that at very 
early times, when the Hubble constant, H >> m, the mass of the field is irrelevant. 
It is thus reasonable to suppose that at this early time @ = ¢o > 0; later we will 
describe some specific suggestions as to how this might come about. This system 
then evolves like the axion or moduli. In the the radiation and matter dominated 
eras, respectively, one has that 


oe ne sin(mt) (radiation) (19.89) 
po. 
= Cnt) sin(mt) (matter). (19.90) 
In either case, the energy behaves, in terms of the scale factor, R(t), as 
E ~ m>¢2 ee (19.91) 


i.e. it decreases like R*, as would the energy of pressureless dust. One can think of 
this oscillating field as a coherent state of ¢ particles with p = 0. 

Now let’s consider the effects of the quartic couplings. Since the field amplitude 
damps with time, their significance will decrease with time. Suppose, initially, that 
& = 0 is real. Then the imaginary part of @ satisfies, in the approximation that € 
and 6 are small, 


b; + 3Hd; +m; © Im(e + 4)¢?. (19.92) 
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For large times, the right-hand side falls as t~°/*, whereas the left-hand side falls 
off only as t~*/*. As a result, just as in our mechanical analogy, baryon number 
(angular momentum) violation becomes negligible. The equation goes over to the 
free equation, with a solution of the form 


Im(€ + 5)¢3 

m2(mt)>/4 

Im(e + 5)¢3 
mt 


oj = a sin(mt + 6,) (radiation), (19.93) 


di = an sin(mt + dm) (matter), (19.94) 


in the radiation and matter dominated cases, respectively. The constants dn, 6;, dm 
and a, can easily be obtained numerically, and are of order unity: 


ad; =0.85 dy =0.85 6,=—0.91 8, = 1.54. (19.95) 


But now we have a non-zero baryon number; substituting in the expression for the 
current, 


2 
ng = 2a,Im(e + 4) %% sin(6, + 2/8) (radiation) (19.96) 
m(mt)2 
2 
Np = 2dmlm(e + 4) %% sin(Sm) (matter). (19.97) 
mnt)? 


Note that CP violation here can be provided by phases in the couplings and/or the 
initial fields. Note also as expected, ng is conserved at late times, in the sense that 
the baryon number per comoving volume is constant. 

This mechanism for generating baryon number could be considered without 
supersymmetry. In that case, it begs several questions. 


e What are the scalar fields carrying baryon number? 
¢ Why are the ¢* terms so small? 
¢ How are the scalars in the condensate converted to more familiar particles? 


In the context of supersymmetry, there is a natural answer to each of these 
questions. First, as we have stressed, there are scalar fields carrying baryon and 
lepton number. As we will see, in the limit that supersymmetry is unbroken, there 
are typically directions in the field space in which the quartic terms in the potential 
vanish. Finally, the scalar quarks and leptons will be able to decay (in a baryon- 
and lepton-number conserving fashion) to ordinary quarks. 


19.6 Flat directions and baryogenesis 


To discuss the problem of baryon number generation, we first want to examine 
the theory in a limit in which we ignore the soft SUS Y-breaking terms. After all, 
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at very early times, H >> My, and these terms are irrelevant. We are now quite 
familiar with the fact that supersymmetric theories often exhibit flat directions. 
At the renormalizable level, the MSSM possesses many flat directions. A simple 


example is 
H, = i.) Li= ia): (19.98) 
Uv 0 


This direction is characterized by a modulus which carries lepton number. Writ- 
ten in a gauge-invariant fashion, ® = H,,L. As we have seen, producing a lep- 
ton number is for all intents and purposes like producing a baryon number. Non- 
renormalizable, higher-dimension terms, with more fields, can lift the flat direction. 
For example, the quartic term in the superpotential: 


1 
La= yg tek? (19.99) 


respects all of the gauge symmetries and is invariant under R-parity. It gives rise to 
a potential 


Vis. (19.100) 


There are many more flat directions, and many of these do carry baryon or 
lepton number. A flat direction with both baryon and lepton number excited is the 
following: 


First generation: Qj =b ith=a Ly2=b (19.101) 
Second: d; = V|b|? + |a|? (19.102) 
Third: d3 = a. (19.103) 


(On Q, the upper index is a color index, the lower index an SU(2) index, and we 
have suppressed the generation indices.) 

Higher-dimension operators again can lift this flat direction. In this case the 
leading term is: 


1 1927 lqyps1 72.73 
Cy = FAO L'a Pa"). (19.104) 


Here the superscripts denote flavor. We have suppressed color and SU (2) indices, 
but the braces indicate sets of fields which are contracted in SU(3) and SU(2) 
invariant ways. In addition to being completely gauge-invariant, this operator is 
invariant under ordinary R-parity. (There are lower-dimension operators, including 
operators of dimension 4, which violate R-parity). It gives rise to a term in the 
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plo 
Vit = Me: (19.105) 
Here ® refers in a generic way to the fields whose vevs parameterize the flat 
directions (a,b). 


19.7 Supersymmetry breaking in the early universe 


We have indicated that higher-dimension, supersymmetric operators give rise to 
potentials in the flat directions. To fully understand the behavior of the fields in the 
early universe, we need to consider supersymmetry breaking, which gives rise to 
additional potential terms. 

In the early universe, we expect supersymmetry is much more badly broken than 
it is in the present era. For example, during inflation, the non-zero energy density 
(cosmological constant) breaks supersymmetry. Suppose that / is the inflaton field, 
and that the inflaton potential arises because of a non-zero value of the auxiliary 
field for J, Fy = OW/OdT. So, during inflation, supersymmetry is broken by a large 
amount. Not surprisingly, as a result, there can be an appreciable supersymmetry- 
breaking potential for ®. These contributions to the potential have the form: 


Vu = H?®* f (®?/M,). (19.106) 


It is perfectly possible for the second derivative of the potential near the origin to 
be negative. In this case, writing our higher-dimension term as: 


1 
Ww, = “ae (19.107) 
the potential takes the form 
1 
V =—A? ||)? + re (19.108) 


The minimum of the potential then lies at: 


H n+1 
Py + M (7) : (19.109) 


More generally, one can see that the higher the dimension of the operator that raises 
the flat direction, the larger the starting value of the field — and the larger the ultimate 
value of the baryon number. Typically, there is plenty of time for the field to find 
its minimum during inflation. After inflation, H decreases, and the field ® evolves 
adiabatically, oscillating slowly about the local minimum for some time. 
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Our examples illustrate that in models with R-parity, the value of n, and hence the 
size of the initial field, can vary appreciably. Which flat direction is most important 
depends on the form of the mass matrix (i.e. on which directions the curvature of 
the potential is negative). With further symmetries, it is possible that n is larger, 
and even that all operators which might lift the flat direction are forbidden. For 
the rest of this section we will continue to assume that the flat directions are lifted 
by terms in the superpotential. If they are not, the required analysis is different, 
since the lifting of the flat direction is entirely associated with supersymmetry 
breaking. 


19.7.1 Appearance of the baryon number 


The term in the potential, |9W/2®|?, does not break either baryon number or CP. 
In most models, it turns out that the leading sources of B and CP violation come 
from supersymmetry-breaking terms associated with F’. These have the form 


am3/2W +bHW. (19.110) 


Here a and b are complex, dimensionless constants. The relative phase in these two 
terms, 5, violates CP. This is crucial; if the two terms carry the same phase, then 
the phase can be eliminated by a field redefinition, and we have to look elsewhere 
for possible CP-violating effects. Examining Eqs. (19.99) and (19.104), one sees 
that the term proportional to W violates B and/or L. In following the evolution 
of the field ©, the important era occurs when H ~ m3/2. At this point, the phase 
misalignment of the two terms, along with the B-violating coupling, leads to the 
appearance of a baryon number. From the equations of motion, the equation for the 
time rate of change of the baryon number is 
dng sin(d)m 3/2 


ae oo. (19.111) 


Assuming that the relevant time is H~', one is led to the estimate (supported by 
numerical studies) 


n — | sino? (19.112) 
B vo 0° * 


Here, po is determined by H © m3), Le. oe = m3.>M". 


19.8 The fate of the condensate 


Of course, we don’t live in a universe dominated by a coherent scalar field. In 
this section, we consider the fate of a homogeneous condensate, ignoring possible 
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inhomogeneities. The following section will deal with inhomogeneities, and the 
interesting array of phenomena to which they might give rise. 

We will adopt the following model for inflation. The features of this picture are 
true of many models of inflation, but by no means all. We will suppose that the 
energy scale of inflation is E ~ 10'> GeV. We assume that inflation is due to a 
field, the inflaton J. The amplitude of the inflaton, just after inflation, we will take 
to be of order M ~ 10!8 GeV (the usual reduced Planck mass). Correspondingly, 
we will take the mass of the inflaton to be m,; = 10! GeV (so that m;M; ee E*), 
Correspondingly, the Hubble constant during inflation is of order H; ~ E?/ M, © 
10!* GeV. 

After inflation ends, the inflaton oscillates about the minimum of its poten- 
tial, much like the field ®, until it decays. We will suppose that the inflaton 
couples to ordinary particles with a rate suppressed by a single power of the 
Planck mass. Dimensional analysis then gives for the rough value of the inflaton 
lifetime: 


ly; = — ~ 1GeV. (19.113) 


The reheating temperature can then be obtained by equating the energy density at 
time H ~ I'(o = 3H*M7) to the energy density of the final plasma: 


Ta = T(t =17') ~ (PyM,)'? ~ 10° GeV. (19.114) 


The decay of the inflaton is actually not sudden, but leads to a gradual reheating 
of the universe, as described, for example, in the book by Kolb and Turner (1990). 
As a function of time (#): 


1/4 


T © (TgH(t)M,) (19.115) 


As for the field ®, our basic assumption is that during inflation, it obtains a large 
value, in accord with Eq. (19.109). When inflation ends, the inflaton, by assumption, 
still dominates the energy density for a time, oscillating about its minimum; the 
universe is matter dominated during this period. The field ® now oscillates about 
a time-dependent minimum, given by Eq. (19.109). The minimum decreases in 
value with time, dropping to zero when H ~ m3/2. During this evolution, a baryon 
number develops classically. This number is frozen once H ~ m3/9. 

Eventually the condensate will decay, through a variety of processes. As we have 
stressed, the condensate can be thought of as a coherent state of ® particles. These 
particles — linear combinations of the squark and slepton fields — are unstable and 
will decay. However, for H < m3/p, the lifetimes of these particles are much longer 
than in the absence of the condensate. The reason is that the fields to which ® 
couples have mass of order ®, and © is large. In most cases, the most important 


19.8 The fate of the condensate 299 


process which destroys the condensate is what we might call evaporation: particles 
in the ambient thermal bath can scatter off of the particles in the condensate, leaving 
final states with only ordinary particles. 

We can make a crude estimate for the reaction rate as follows. Because the 
particles which couple directly to ® are heavy, interactions of ® with light particles 
must involve loops. So we include a loop factor in the amplitude, of order a5, the 
weak coupling squared. Because of the large masses, the amplitude is suppressed 
by ®. Squaring, and multiplying by the thermal density of scattered particles, 
gives: 

Tp ~ ain = (THM) (19.116) 
The condensate will evaporate when this quantity is of order H. Since we know the 
time dependence of ®, this allows us to solve for this time. One finds that equality 
occurs, in the casen = 1, for H; ~ 107-10? GeV. Forn > 1, it occurs significantly 
later (for n < 4, it occurs before the decay of the inflaton; for n > 4, a slightly 
different analysis is required than that which follows). In other words, for the case 
n = 1, the condensate evaporates shortly after the baryon number is created, but 
for larger n, it evaporates significantly later. 

The expansion of the universe is unaffected by the condensate as long as the 
energy density in the condensate, po ~ m2”, is much smaller than that of the 
inflaton, p; ~ H*M?. Assuming that me ~ m3 j2 ~ 0.1-1 TeV, a typical super- 
symmetry breaking scale, one can estimate the ratio of the two densities at the time 
when H ~ m3/2 as 


2/(n+1) 
wae (=) ; (19.117) 
PI My 


We are now in a position to calculate the baryon to photon ratio in this model. 
Given our estimate of the inflaton lifetime, the coherent motion of the inflaton still 
dominates the energy density when the condensate evaporates. The baryon number 
is just the ® density just before evaporation divided by the ® mass (assumed of 
order m3 /2), while the inflaton number is p;/M7. So the baryon to inflaton ratio 
follows from Eq. (19.117). With the assumption that the inflaton energy density is 
converted to radiation at the reheating temperature, Tp, we obtain: 


(n-1) 


T Ty M. (n+l) 
nm tp BIR Po 10" ( : R ) ( p ) (19.118) 
ny (p1/TR) Neme pr 10° GeV J \m3/2 


Clearly the precise result depends on factors beyond those indicated here explic- 
itly, such as the precise mass of the ® particle(s). But as a rough estimate, it is rather 
robust. For n = 1, it is in precisely the right range to explain the observed baryon 
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asymmetry. For larger n, it can be significantly larger. In light of our discussion of 
late decays of moduli this is potentially quite interesting. These decays produce a 
huge amount of entropy, typically increasing the entropy by a factor of 10’ or so. 
The baryon density is diluted by a corresponding factor. But we see that coherent 
production can readily yield baryon to photon densities, prior to moduli decay, of 
the needed size. 

There are many other issues which can be studied, both in leptogenesis and in 
Affleck—Dine baryogenesis, but it appears that both types of process might well 
account for the observed baryon asymmetry. The discovery (or not) of low-energy 
supersymmetry, and further studies of neutrino masses, might make one or the other 
picture more persuasive. Both pose challenges, as they involve couplings which we 
are not likely to measure directly. 


19.9 Dark energy 


It has long been recognized that any cosmological constant in nature is far smaller 
than scales of particle physics. Before the discovery of the dark energy, many physi- 
cists conjectured that for some reason of principle, this energy is zero. However, it is 
now clear that the energy is non-zero, and in fact that this dark energy is the largest 
component of the energy density of the universe. Present data is compatible with 
the idea that this energy density represents a cosmological constant (w = —1), 
but other suggestions, typically involving time-varying scalar fields, have been 
offered. 

Apart from its smallness, another puzzle surrounding the cosmological constant 
is simply one of coincidence: why is the dark energy density today comparable to 
the dark matter density? Weinberg has argued that it couldn’t be much different 
than this in a universe containing stars and galaxies, provided all of the other laws 
of nature are as we observe. The basic point is that if the dark energy were, say, 
10° times more dense than we observe, it would have come to dominate the energy 
density when the universe was much younger than it is today — prior to the formation 
of galaxies and stars. The rapid acceleration after that time would have prevented 
the formation of structure. More refined versions of the argument give estimates 
for the dark energy within a factor of ten of the measured value. 

Weinberg speculated that perhaps the universe is much larger than we see (our 
current horizon). In other regions, it has different values of the cosmological 
constant. Only in those regions where A is very small would stars — and hence 
observers — form. Weinberg called this possible explanation (actually prediction) 
of A the weak anthropic argument. We will return to this question in our studies 
of string theory, where we will see that such a landscape of ground states may 
exist. 
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Suggested reading 


Seminal papers on inflation include that of Guth (1981), which proposed a version 
of inflation now often referred to as “old inflation,” and those of Linde (1982) and 
Albrecht and Steinhardt (1982), which contain the germ of the slow roll inflation 
idea stressed in this work. The ideas of hybrid inflation were developed by Linde 
(1994); those specifically discussed here were introduced by Randall et al. (1996) 
and Berkooz et al. (2004). There are a number of good texts on inflation and related 
issues, some of which we have mentioned in the previous chapter. These include 
those of Dodelson (2004), Kolb and Turner (1990), and Linde (1990). Dodelson 
provides a particularly up to date discussion of dark matter, including more de- 
tailed calculations than those presented here, and dark energy, including surveys 
of observational results. For a review of axions and their cosmology and astro- 
physics, see Turner (1990). For more recent papers which raise questions about the 
cosmological axion limits, see, for example, Banks et al. (2003). The cosmolog- 
ical moduli problem, and possible solutions, were first discussed by Banks et al. 
(1994) and de Carlos et al. (1993). A general review of electroweak baryogenesis, 
including detailed discussions of phenomena at the bubble walls, appears in Cohen 
et al. (1993). A discussion of electroweak baryogenesis within the MSSM appears 
in Carena et al. (2003). A detailed review of baryogenesis appears in Buchmuller 
et al. (2005), while Enqvist and Mazumdar (2003) focuses on Affleck—Dine baryo- 
genesis. A more comprehensive review of baryogenesis mechanisms appears in 
Dine and Kusenko (2003). Aspects of the cosmological constant, and especially 
Weinberg’s anthropic prediction of A, are explained clearly in Weinberg (1989), 
with more recent additions in Vilenkin (1995) and Weinberg (2000). 


Exercises 


(1) Verify the slow roll conditions, Eqs. (19.12) and (19.13). Determine the number of 
e-foldings and the size of 50/p as a function of NV. 

(2) Work through the limits on the axion in more detail. Try to analyze the behavior of the 
axion energy in the high-temperature regime. 

(3) Construct a discrete R-symmetry which guarantees that the Hy L flat direction is exactly 
flat. Assuming that the universe reheats to 100 MeV when a modulus decays, estimate 
the final baryon number of the universe in this case. 

(4) Suppose that the characteristic scale of supersymmetry breaking is much higher than 1 
TeV, say 10° GeV. Discuss baryogenesis by coherent scalar fields in such a situation. 


Part 3 
String theory 


20 


Introduction 


String theory was stumbled on, more or less, by accident. In the late 1960s, string 
theories were first proposed as theories of the strong interactions. But, it was quickly 
realized that, while hadronic physics has a number of string-like features, string 
theories were not suitable for a detailed description. In their simplest form, string 
theories had massless spin-two particles and more than four dimensions of space- 
time, hardly features of the strong interactions. But a small group of theorists 
appreciated that the presence of a spin-two particle implied that these theories were 
generally covariant and explored them through the 1970s and and early 1980s as 
possible theories of quantum gravity. Like field theories, the number of possible 
string theories seemed to be infinite, while, unlike field theories, there was reason to 
believe that these theories did not suffer from ultraviolet divergences. In the 1980s, 
however, studies of anomalies in higher dimensions suggested that all string theories 
with chiral fermions and gauge interactions suffered from quantum anomalies. But 
in 1984, it was shown that anomalies cancel for two choices of gauge group. It was 
quickly recognized that the non-anomalous string theories do come close to unifying 
gravity and the Standard Model of particle physics. Many questions remained. 
Beginning in 1995, great progress was made in understanding the deeper structure 
of these theories. All of the known string theories were understood to be different 
limits of some larger structure. As string theories still provide the only framework 
in which one can do systematic computations of quantum gravity effects, many 
workers use the term “string theory” to refer to some underlying structure which 
unifies quantum mechanics, gravity and gauge interactions. 

String theory has provided us with many insights into what a fundamental the- 
ory of gravity and gauge interactions might look like, but there is still much we 
do not understand. We can’t begin a course by enunciating some great principle 
and seeing what follows. We might, for example, have imagined that the under- 
lying theory would be a string field theory, whose basic objects would be objects 
which would create and annihilate strings. Some set of organizing principles would 
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determine the action for this system, and the rest would be a problem of work- 
ing out the consequences. But there are good reasons to believe that string theory 
is not like this. Instead, we can at best provide a collection of facts, organized 
according to the teacher/author/professor’s view of the subject at any given mo- 
ment. As a result, it is perhaps useful first to give at least some historical per- 
spective as to how these facts were accumulated, if only to show that there are, 
as of yet, no canonical texts or sacred principles in the subject. In the next sec- 
tion, we review a bit of the remarkable history of string theory. In the following 
section, we will attempt to survey what is known as of this writing: the various 
string theories, with their spectra and interactions, and the connections between 
them. 


20.1 The peculiar history of string theory 


For electrodynamics, the passage from classical to quantum mechanics is reason- 
ably straightforward. But general relativity and quantum mechanics seem funda- 
mentally incompatible. Viewed as a quantum field theory, Einstein’s theory of 
general relativity is a non-renormalizable theory; its coupling constant (in four di- 
mensions) has dimensions of inverse mass-squared. As aresult, quantum corrections 
are very divergent. From the point of view developed in Part 1, these divergences 
should be thought of as cut off at some scale associated with new physics; general 
relativity is an incomplete theory. Hawking has discussed another sense in which 
gravity and quantum mechanics seem to clash. Hawking’s paradox appears to be 
associated with phenomena at arbitrarily large distances — in particular, with the 
event horizons of large black holes. Because black holes emit a thermal spectrum 
of radiation, it seems possible for a pure state — a large black hole — to evolve 
into a mixed state. These puzzles suggest that reconciling quantum mechanics and 
gravity will require a radical rethinking of our understanding of very short-distance 
physics. 

Apart from its potential to reconcile quantum mechanics and general relativity, 
there is another reason that string theory has attracted so much attention: it is finite, 
free of the ultraviolet divergences that plague ordinary quantum field theories. In the 
previous chapters of this book, we have adopted the point of view that our theories of 
nature should be viewed as effective theories. Itis not clear that they can be complete 
in any sense. One might wonder whether some sort of structure exists where the 
process stops; where some finite, fundamental theory accounts for the features of 
our present, more tentative constructions. Many physicists have speculated through 
the years that these two questions are related; that an understanding of quantum 
general relativity would provide a fundamental length scale. The finiteness of string 
theory suggests it might play this role. 
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String theory was discovered, by accident, in the 1960s, as physicists tried to 
understand certain regularities of the hadronic S-matrix. In particular, hadronic 
scattering amplitudes exhibited a feature then referred to as “duality.” Scattering 
amplitudes with two incoming and two outgoing particles (so-called 2 > 2 pro- 
cesses) could be described equally well by an exchange of mesons in the s channel 
or in the ¢ channel (but not both simultaneously). This is not a property, at least per- 
turbatively, of conventional quantum field theories. Veneziano succeeded in writing 
an expression for an S-matrix with just the required properties. Veneziano’s result 
was extended in a variety of ways and it was soon recognized (by Nambu, Susskind 
and others) that this model was equivalent to a theory of strings. 

One could well imagine coming to string theory by a different route. Quantum 
field theory describes point particles. Apart from properties like mass and charge, 
no additional features (size, shape) are assigned to the basic entities. One could 
well imagine that this is naive, but in describing nature, quantum field theory is 
extraordinarily successful. In fact, there is no evidence for any size of the electron 
or the quarks down to distances of order 10~'’ cm (energy scales of order several 
TeV). Still, it is natural to try to go beyond the idea of particles as points. The 
simplest possibility is to consider entities with a one-dimensional extent, strings. 
In the next chapters, we will discuss the features of theories of string. Here we just 
note that a straightforward analysis yields some remarkable results. A relativistic 
quantum string theory is necessarily: 


(1) a theory of general relativity; 

(2) a theory with gauge interactions; 

(3) finite: string world sheets are smooth. Strings do not interact at space-time points. As 
a result, in perturbation theory, one does not have the usual ultraviolet divergences of 
quantum theories of relativistic particles. 


These features are not postulated; they are inevitable. Other, seemingly less de- 
sirable, features also emerge: the space-time dimension has to be 26 or 10. Many 
string theories also contain tachyons in their spectrum, whose interpretation is not 
immediately clear. 

As theories of hadronic physics, string theories had only limited success. Their 
spectra and S-matrices did share some features in common with those of the real 
strong interactions. But as a result of the features described above — massless 
particles and unphysical space-time dimensions as well as the presence of tachyons 
in many cases — strings were quickly eclipsed by QCD as a theory of the strong 
interactions. 

Despite these setbacks, string theory remained an intriguing topic. String theories 
were recognized to have short distance behavior much different — and better — than 
that of quantum field theories. There was reason to think that such theories were free 
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of ultraviolet divergences altogether. Scherk and Schwarz, and also Yoneya, made 
the bold proposal that string theories might well be sensible theories of quantum 
gravity. At the time, any concrete realization of this suggestion seemed to face enor- 
mous hurdles. The first string theories contained bosons only. But string theories 
with fermions were soon studied, and were discovered to have another remarkable, 
and until then totally unfamiliar, property: supersymmetry. We have already learned 
a great deal about supersymmetry, but at this early stage, its possible role in nature 
was completely unclear. In their early formulations, string theories only made sense 
in special — and at first sight uninteresting — space-time dimensions. But it had been 
conjectured since the work of Kaluza and Klein that higher-dimensional space- 
times might be “compactified,” leaving theories which appear four-dimensional, 
and Scherk and Schwarz hypothesized that this might be the case for string theo- 
ries. Over a decade, Green and Schwarz studied the supersymmetric string theories 
further, developing a set of calculational tools in which supersymmetry was mani- 
fest, and which were suitable for tree-level and one-loop computations. Witten and 
Alvarez-Gaume, however, pointed out that higher-dimensional theories in general 
suffer from anomalies, which render them inconsistent. They argued that almost 
all of the then-known chiral string theories suffered from just such anomalies. It 
appeared that the string program was doomed; only two known string theories, 
theories without gauge interactions, seemed to make sense. Green and Schwarz, 
however, persisted. By a direct string computation, they discovered that, while it 
was true that almost all would-be string theories with gauge symmetries are in- 
consistent, there was one exception among the then-known theories, with a gauge 
group O(32). They reviewed the standard anomaly analysis and realized why O(32) 
is special; this work raised the possibility that there might be one more consistent 
string theory, based on the gauge group Eg x Es. The corresponding string theory, 
as well as another with gauge group O(32) (known as the heterotic string theories), 
was promptly constructed. 

This work stimulated widespread interest in string theory as a unified theory 
of all interactions, for now these theories appeared to be not only finite theories 
of gravity, but also nearly unique. Compactification of the heterotic string on six- 
dimensional manifolds known as Calabi—Yau spaces were quickly shown to lead to 
theories which at low energies closely resemble the Standard Model, with similar 
gauge groups, particle content, and other features such as repetitive generations, 
low-energy supersymmetry and dynamical supersymmetry breaking. The various 
string theories have since been shown to be part of a larger theory, suggesting that 
one is studying some unique structure which describes quantum gravity. Some of 
the basic questions about quantum gravity theories, such as Hawking’s puzzle, have 
been, at least partially, resolved. 
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Many questions remain, however. There is still no detailed understanding of how 
string theory can make contact with experiment. There are a number of reasons for 
this. String theory, as we will see, is a theory with no dimensionless parameters. 
This is a promising beginning for a possible unified theory. But it is not clear how a 
small expansion parameter can actually emerge, allowing systematic computation. 
String theory provides no simple resolution of the cosmological constant puzzle. 
Finally, while there are solutions which resemble nature, there are vastly more 
which don’t. A principle, or dynamics, which might explain the selection of one 
vacuum or another, has not emerged. 

Yet string theory is the only model we have for a quantum theory of gravity. 
More than that, it is the only model we have for a finite theory, which could be 
viewed as some sort of ultimate theory. At the same time, string theory addresses 
almost all of the deficiencies we have seen in the Standard Model, and has the 
potential to encompass all of the solutions we have proposed. The following are 
some examples. 


(1) The theory unifies gravity and gauge interactions in a consistent, quantum mechanical 
framework. 

(2) The theory is completely finite. It has no free parameters. The constants of nature must 
be determined by the dynamics, or other features internal to the theory. 

(3) The theory possesses solutions in which space-time is four-dimensional, with gauge 
groups close to the Standard Model and repetitive generations. It is in principle possible 
to compute the parameters of the Standard Model. 

(4) Many of the solutions exhibit low-energy supersymmetry, of the sort we have considered 
in the first part of this book. 

(5) Other solutions exhibit large dimensions, technicolor-like structures, and the like. 

(6) The theory does not have continuous global symmetries, but often possesses discrete 
symmetries, of the sorts we have considered. 


While these are certainly encouraging signs, as we will learn in the third part of 
this book, we are a long way from a detailed understanding of how string theory 
might describe nature. We will see that there are fundamental obstacles to such an 
understanding. At the same time, we will see that string theory provides a useful 
framework in which to assess proposals for beyond the Standard Model physics. 

The third part of this book is intended to provide the reader with an overview 
of superstring theory, with a view to connecting string theory with nature. In the 
next chapter, we will study the bosonic string. We will understand how to find the 
spectra of string theories. We will also understand string interactions. The reason 
that string theories are so constrained is that strings can only interact in a limited 
set of ways, essentially by splitting and joining. We will explain how to translate 
this into concrete computations of scattering amplitudes. 
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In subsequent chapters, we will turn to the superstring theories, obtaining their 
spectra and understanding their interactions. We will then turn to compactification 
of string theories, focussing mainly on compactifications to four dimensions. We 
first consider toroidal compactifications of strings, whose features can be worked 
out quite explicitly. We also discuss orbifolds, simple string models which can 
exhibit varying amounts of supersymmetry. Then we devote a great deal of attention 
to compactifications on Calabi-Yau spaces. These are smooth spaces; superstring 
theories compactified on these spaces exhibit varying amounts of supersymmetry. 
Many look quite close to the real world. 

Finally, we will turn to the question of developing a realistic string phenomenol- 
ogy. Having seen the many intriguing features of string models, we will point out 
some of the challenges. These are as follows. 


(1) The proliferation of classes of string vacua. 

(2) Within different classes, the existence of moduli. 

(3) Mechanisms which generate potentials for moduli are known, but in regimes where 
calculations can be performed systematically, tend not to produce stable minima. The 
question of supersymmetry breaking is closely related to the question of stabilizing 
moduli. 

(4) There are detailed issues, such as proton decay, features of quark and lepton masses, 
and many others. 


We will touch on some proposed solutions to these puzzles. Much string model 
building simply posits that moduli have been fixed in some way, and a vacuum 
with desirable properties is somehow selected by some (unknown) overarching 
principle. This is often backed up by calculations, which, while not systematic, are 
at least suggestive that moduli are stabilized. An alternative viewpoint is provided 
by the “landscape.” Here, one starts with the observation that introducing fluxes for 
various tensor fields can potentially stabilize moduli. The possible choices of flux 
vastly increase the possible array of (metastable) string ground states. If one simply 
accepts that there is such a landscape of states, and that the universe samples many 
of these states in some way, then one is led to think about distributions of parameters 
of low-energy physics, not merely the coupling constants, but the gauge groups, 
particle content, scale of supersymmetry breaking, and value of the cosmological 
constant. For better or worse, this is in some sense the ultimate realization of the 
notions of naturalness which so concerned us in Part 1. The question is why we are 
the likely outcome of a distribution of this sort. We will leave it for the readers — 
and for experiment — to sort out which, if any, of these viewpoints may be correct. 

This is not a string theory textbook. The reader will not emerge from these few 
chapters with the level of technical proficiency in weakly coupled strings provided 
by Polchinski’s text, or with the expertise in Calabi-Yau spaces provided by the 
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book of Green, Schwarz and Witten (1987). In order to quickly obtain the spectra 
of various string theories, the following chapters heavily emphasize light cone 
techniques. While some aspects of the covariant treatment are developed in order 
to explain the rules for computing the S-matrix, many important topics, especially 
the Polyakov path integral approach and BRST quantization, are given only cursory 
treatment. Similarly, the introduction to D-brane physics provides some basic tools, 
but does not touch on much of the well-developed machinery of the subject. The 
reader who wishes a more thorough grounding in the physics of D-branes will want 
to consult the texts of Polchinski (1998) and of Johnson (2003). 


Suggested reading 


The introduction of the book by Green et al. (1987) provides a particularly good 
overview of the history of string theory, and some of its basic structure. The intro- 
ductory chapter of Polchinski’s text (1998) provides a good introduction to more 
recent developments, and a perspective on why strings might be important in the 
description of nature. 


21 


The bosonic string 


Consider a particle moving through space. As it moves, it sweeps out a path called 
a world line. The action of the particle is just the integral of the invariant length 
element along the path, up to a constant. 

Suppose we want to describe the motion of a string. A string, as it moves, 
sweeps out a two-dimensional surface in space-time called a world sheet. We can 
parameterize the path in terms of two coordinates, one time-like and one space-like, 
denoted o and T, or og and o;. The action should not depend on the coordinates 
we use to parameterize the surface. Polyakov stressed that this can be achieved by 
using the formalism of general relativity. Introduce a two-dimensional metric, yog. 
Then an invariant action is: 


T 
Ss 5 | PoP eX" 36 Xt (21.1) 
Here our conventions are such that for a flat space, 
—-1 0 
(similarly, our D-dimensional space-time metric is ds* = —dt? + dx). 


This action has a large symmetry group. There are, first, general coordinate 
transformations of the two-dimensional surface. For a simple topology (plane or 
sphere), these permit us to bring the metric to the form: 


y =e?n. (21.3) 
In this gauge (the conformal gauge) the action is independent of ¢: 
—T 
S= a i dan”? dy X" Ag X" Nu. (21.4) 


It is possible to fix this symmetry further. To motivate this gauge choice, we 
consider an analogous problem in field theory. In a gauge theory like QE D, we can 
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fix a covariant gauge, 0 - A = 0. This gauge fixing, while manifestly Lorentz invari- 
ant, is not manifestly unitary. We might try to quantize covariantly by introducing 
creation and annihilation operators, a”. These would obey: 


[a“,at’] = g” (21.5) 


so that some states seem to have negative norm. If one proceeds in this way, it is 
necessary to prove that states with negative (or vanishing) norm can’t be produced 
in scattering amplitudes. 

One way to deal with this is to choose a non-covariant gauge. Coulomb gauge is a 
familiar example, but a particularly useful description of gauge theories is obtained 
by choosing the “light cone gauge.” First, define light cone coordinates: 


= piste Po, (21.6) 


The remaining, transverse coordinates, we will simply denote as X. Correspond- 


ingly, one defines the light cone momenta: 
1 


p* RP ep es (21.7) 


x2 


Note that 
A-B=—(AtB” +A-Bt)+A-B. (21.8) 


Now we will think of x* as our time variable. The “Hamiltonian” generates trans- 
lations in x*; it is in fact p~. Note that for a particle: 


p’=—2ptp +p’. (21.9) 
The Hamiltonian is: 
1 
H= aa (21.10) 
P 


Having made this choice of variables, one can then make the gauge choice At = 0. 
In the Lagrangian, there are no terms involving 0,A~, so A~ is not a dynamical 
field; only the D — 2 A's are dynamical. So we have the correct number of physical 
degrees of freedom. One simply solves for A~ by its equations of motion. In the 
early days of QCD, this description proved very useful in understanding very-high- 
energy scattering. In practice, similar algebraic gauges are still very useful. 

Light cone coordinates, more generally, are very helpful for identifying physical 
degrees of freedom. Consider the problem of counting the degrees of freedom asso- 
ciated with some tensor field, A“”?. For a massive field, one counts by going to the 
rest frame, and restricting the indices jz, v, o to be D — 1 dimensional. For a mass- 
less field, the relevant group is the “little group” of the Lorentz group, SO(D — 2). 
Correspondingly, one restricts the indices to be D — 2 dimensional. So, for example, 
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for a massless vector, there are D — 2 degrees of freedom. For asymmetric, traceless 
tensor (the graviton), there are [(D — 2)(D — 1)/2] — 1. Light cone coordinates, 
and light cone gauge, provide an immediate realization of this counting. 

For many questions in quantum field theory, covariant methods are much more 
powerful than the light cone. Quantum field theorists are familiar with techniques 
for coping with covariant gauges. These involve introduction of additional fictitious 
degrees of freedom (Faddeev—Popov ghosts). It is probably fair to say that most do 
not know much about gauges like the light cone gauge (there is almost no treatment 
of these topics in standard texts). But we will see in string theory that the light cone 
gauge is quite useful in isolating the physical degrees of freedom of strings. It lacks 
some of the elegance of covariant treatments, but it avoids the need to introduce an 
intricate ghost structure, and, as in field theory, the physical degrees of freedom are 
manifest. The differences between the covariant and light cone treatments, as we 
will see, are most dramatic when we consider supersymmetric strings. In the light 
cone approach of Green and Schwarz, space-time supersymmetry is manifest. In 
the covariant approach, it is not at all apparent. On the other hand, for the discussion 
of interactions, the light cone treatment tends to be rather awkward. In this chapter, 
we will first introduce the light cone gauge, and then go on to discuss aspects of 
the covariant formulation. The suggested readings will satiate the reader interested 
in more details of the covariant treatment. 


21.1 The light cone gauge in string theory 
21.1.1 Open strings 


In the conformal gauge, we can use our coordinate freedom to choose X* = t. We 
also can choose the coordinates such that the momentum density, P*, is constant 
on the string. In this gauge, in D dimensions, the independent degrees of freedom 


of a single string are the coordinates, X'(o,t), 1 =1,..., D—2. They are each 
described by the Lagrangian of a free two-dimensional field: 
T 
S= 5 | Pow.x!? SOX ys (21.11) 


It is customary to define another quantity, a’ (the “Regge slope”), with dimensions 
of length-squared: 
a’ = cis 
2nT 
We will generally take a step further and use units with a’ = 1/2. In this case, the 
action is: 


(21.12) 


S= = i d’o((a,X')* — (8, X")). (21.13) 
Qn 
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The reader should be alerted that there is another common choice of units: a’ = 2, 
and we will encounter this later. In this case, the action has a 1/8z out front. 

In order to write equations of motion, we need to specify boundary conditions in 
o. Consider, first, open strings, i.e. strings with two free ends. We want to choose 
boundary conditions so that when we vary the action we can ignore surface terms. 
There are two possible choices. 


(1) Neumann boundary conditions, 
0, X'(t, 0) = 0,X'(t, 1) = 0. (21.14) 

(2) Dirichlet boundary conditions: 
X!(r, 0) = X/(t, x) = constant. (21.15) 


It is tempting to discard the second possibility, as it appears to violate translation 
invariance. For now, we consider only Neumann boundary conditions, but we will 
return later to Dirichlet. 

We want to write a Fourier expansion for the X/s. The normalization of the 
coefficients is conventionally taken to be somewhat different than that of relativistic 
quantum field theories: 


1 ; 
X' =x! 4 p'tH+i ae —ale-" cos(no). (21.16) 
n#0 nt 
The qs are, up to constants, ordinary creation and annihilation operators: 
of = Wnag .@) = /nal. (21.17) 


Because we are working at finite volume (in the two-dimensional sense) there are 
normalizable zero modes, the x/s and p/s. They correspond to the coordinate and 
momentum of the center of mass of the string. From our experience in field theory, 
we know how to quantize this system. We impose the commutation relations: 


[8,X/(o, t), X’(o’, tT] = “sg =o"); (21.18) 
a 
This is satisfied by: 
esp baie” la) .e7) = nds oo (21.19) 


The states of this theory can be labeled by their transverse momenta, p, and by 
integers corresponding to the occupation numbers of the infinite set of oscillator 
modes. It is helpful to keep in mind that this is just the quantization of a set of free, 
two-dimensional fields in a finite volume. 
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We can write a Hamiltonian for this system. Normal ordering: 
H=p’?+N+a (21.20) 
where 
[oe 

NaS oa, (21.21) 
and a is a normal ordering constant. States can be labelled by the occupation 

numbers for each mode, N,,, and the momentum, oe 
Ip’, {Nn,;}) (21.22) 


The light cone Hamiltonian, H, generates translations in t. It is convenient to 
refine the gauge choice: 


XP pe, 
Since p™ is conjugate to the light cone time, x*, 
p =H/p", (21.23) 
or 
pip =p+N+a, M?=N+a. (21.24) 


So the quantum string describes a tower of states, of arbitrarily large mass. The 
constant a is not arbitrary. We will see shortly that 


a=-l. (21.25) 
This means that the lowest state is a tachyon. We can label this state simply 


|T(p)) = |p, {0}) = |p). (21.26) 


The state carries transverse momentum p and longitudinal momenta p* and p~, and 
is annihilated by the infinite tower of oscillators. The significance of this instability 
is not immediately clear; we will close our eyes to it for now and proceed to look 
at other states in the spectrum. When we study the superstring, we will often find 
that there are no tachyons. 

The first excited state is the state 


|A’) =a! |p). (21.27) 
Its mass is 


m,=1+a. (21.28) 
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Now we can see why a = —1. Here, A is a vector field with D — 2 components. 
In D dimensions, a massive vector field has D — 1 degrees of freedom; a massless 
vector has D — 2 degrees of freedom. So A must be massless, and a = 1 if the 
theory is Lorentz invariant. Later, we will give a fancier argument for the value of 
a, but the content is equivalent. 

At level 2, we have a number of states: 


a! 4|p) a! a7, |p). (21.29) 
These include a vector, a scalar, and a symmetric tensor. We won’t attempt here to 
group them into representations of the Lorentz group. 

It turns out that the value of D is fixed, D = 26. In the light cone formulation, the 
issue is that the light cone theory is not manifestly Lorentz invariant. To establish 
that the theory is Poincaré invariant, it is necessary to construct the full set of 
Lorentz generators and carefully check their commutators. This analysis yields the 
conditions D = 26 and a = —1. We will discuss the derivation of this result a bit 
more later. In a manifestly covariant formulation, such as the conformal gauge, the 
issue is one of unitarity, as in gauge field theories. Decoupling of negative and zero 
norm states yields, again, the condition D = 26. 

Turning to the gauge boson, it is natural to ask: what are the fields charged 
under the gauge symmetry. The answer is suggested by a picture of a meson as a 
quark and antiquark connected by a string. We can allow the ends of the strings 
to carry various types of charges. In the case of the bosonic string, these can be, 
for example, a fundamental and anti-fundamental of SU(N). Then the string itself 
transforms as a tensor product of vector representations. Because the open strings 
include massless gauge bosons, this product must lie in the adjoint representation 
of the group. In the bosonic string theory, one can also have SO(N) and Sp(N) 
groups. In the case of the superstring, we will see that the group structure is highly 
restricted. The theory will make sense only in flat ten dimensions, and then only if 
the group is O(32). 


21.2 Closed strings 


We have begun with open strings, since these are in some ways simplest, but theories 
of open strings by themselves are incomplete. There are always processes which 
will produce closed strings. For closed strings, we again have a set of fields X/(o, T). 
Their action is identical to that we wrote before. But they now obey the boundary 
conditions: 


X!(o +2, T) = X'(o, 7). (21.30) 
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Again, we can write a mode expansion: 
i 1 
x! _ x! Ir ee = aie zint—2) ale 2intt+o) . 21.31 
+ pit +5) (a + &, ) (21.31) 


The exponential terms are the familiar solutions to the two-dimensional wave equa- 
tion. One can speak of modes moving to the left (“left movers”) and to the right 
(“right movers’) on the string. Again we have commutation relations: 


eps it™ [e, cot Sndiayd” + [Oat ndeigd'"s (21,32) 


n? 


Now the Hamiltonian is: 


H=p’+N+N+b (21.33) 
where 
[oe a Ce 
Mea eal NS aa! (21.34) 
n=1 n=l 


In working out the spectrum, there is an important constraint. There should be 
no special point on the string, i.e. translations in the o direction should leave states 
alone. The generator of constant shifts of o can be found by the Noether procedure: 


P= [ 4oa.x'a,x" =N—-N. (21.35) 


So we need to impose the condition N = N on the states. 
Once more, the lowest state is a scalar, 


IT) =|p) mp=b. (21.36) 
Because of the constraint, the first excited state is: 
|W77) = a! oe’ |p). (21.37) 


We can immediately decompose these states into irreducible representations of 
the little group; there is a symmetric traceless tensor; a scalar (the trace); and an 
antisymmetric tensor. A symmetric, traceless tensor should have, if massive, poe 
D — | states. Here, however, we have only D? — 3D +1 states. This is precisely 
the correct number of states for a massless, spin-two particle — a graviton. The 
remaining states are precisely the number for a massless antisymmetric tensor field 
and a scalar. So we learn that b = —2. 

This is a remarkable result. General arguments, going back to Feynman, 
Weinberg and others, show that a massless spin-two particle, in a relativistic theory, 
necessarily couples like a graviton in Einstein’s theory. So string theory is a theory 
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of general relativity. This bosonic string is clearly unrealistic. But the presence of 
the graviton will be a feature of all string theories, including the more realistic ones. 


21.3 String interactions 


The light cone formulation is very useful for determining the spectrum of string 
theories, but it is somewhat more awkward for the discussion of interactions. As 
explained in the introduction, string interactions are determined geometrically, by 
the nature of the string world sheet. Actually turning drawings of world sheets 
into a practical computational method is surprisingly straightforward. This is most 
easily done using the conformal symmetry of the string theory. So we return to the 
conformal gauge. There are close similarities between the treatment of both open 
and closed strings. We will start with the treatment of closed strings, for which the 
Green functions are somewhat simpler. At the end of this chapter we will return to 
open strings. 


21.3.1 String theory in conformal gauge 


In conformal gauge, the action is: 


= ~ / d°o((a,X") — (8, X")’). (21.38) 
Introducing the two-dimensional light cone coordinates 
04 =o £0; (21.39) 
the flat world sheet metric takes the form 
N= =~ = =5 (21.40) 
and the action can be written: 
S= = | dos da_dg,X"d,_X". (21.41) 


At the classical level, this action is invariant under a conformal rescaling of the 
coordinates. If we introduce light cone coordinates on the world sheet, then the 
action is invariant under the transformations: 


Ce flan): (21.42) 


Later, we will Wick rotate, and work with complex coordinates; these conformal 
transformations will then be the conformal transformations familiar in complex 
variable theory. It is well known that by a conformal transformation one can map 
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the plane into a sphere, for example. In this case, the regions at infinity with incoming 
or outgoing strings are mapped to points. The creation or destruction of strings at 
these points is described by local operators in the two-dimensional, world sheet 
theory. In order to respect the conformal symmetry, these operators must, like the 
action, be integrals over the world sheet of local dimension-two operators. These 
operators are known as vertex operators, V(o, T). 

In conformal gauge, the action also contains Faddeev—Popov ghost terms, asso- 
ciated with fixing the world sheet general coordinate invariance. We will discuss 
some of their features later. But let’s focus on the fields X“ first. If we simply write 
mode expansions for the fields (taking closed strings, for definiteness) 


1 . 
X# =x" + ptr tid” —(ahe 9) + Ge Aero) (21.43) 


n#0 n 
we will encounter difficulties. The ws will now obey commutation relations: 
[x", p’] =ig*” Rea ap | = Nbpin gr”. (21.44) 


If we proceed naively, the minus sign means from g that we will have states in 
the spectrum of negative or zero norm. 

The appearance of negative norm states is familiar in gauge field theory. The res- 
olution of the problem, there, is gauge invariance. One can either choose a gauge in 
which there are no states with negative norm, or one can work in a covariant gauge 
in which the negative norm states are projected out. In a modern language, this pro- 
jection is implemented by the BRST procedure. But it is not hard to check that in a 
covariant gauge, low-order diagrams in QED, for example, give vanishing ampli- 
tudes to produce negative or zero norm states (photons with time-like or light-like 
polarization vectors). In gauge theories, it is precisely the gauge symmetry which 
accounts for this. In string theory, it is another symmetry, the residual conformal 
symmetry of the conformal gauge. 

In the chapter on general relativity, we learned that differentiation of the matter 
action with respect to the metric gives the energy-momentum tensor. In Einstein’s 
theory, differentiating the Einstein term as well gives Einstein’s equations. In the 
string case, the world sheet metric has no dynamics (the Einstein action in two 
dimensions is a total derivative), and the Euler-Lagrange equation for y yields the 
equation that the energy-momentum tensor vanishes. Quantum mechanically, these 
will be constraint equations. The components of the energy-momentum tensor are: 


1 
Tio = To. = 99 X- HX Too = Ti = 5 (ox) + (3:X)’). (21.45) 


The energy-momentum tensor is traceless. This is a consequence of conformal in- 
variance; you can show that the trace is the generator of conformal transformations. 
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In terms of the light cone coordinates, the non-vanishing components of the stress 
tensor are: 


T4 = 0,X -0,X T. =d0 X-d XxX. (21.46) 
Note that 7,_ = 7_; = 0. Energy-momentum conservation then says: 
0_T,,=0 04T__=0. (21.47) 


As a result, any quantity of the form f(x*)T,4 or f(x )T__ is also conserved. 
Integrating over the world sheet, this gives an infinite number of conserved charges. 

We want to impose the condition of vanishing stress tensor as a condition on 
states. There is a problem, however, and this is one way of understanding the origin 
of the critical dimension, 26. The obstacle is an anomaly, similar to anomalies we 
encountered in the first part of this text. One can see the problem if one takes the 
mode expansions for the X“s and works out the commutators for the 7's. We will 
show in the next section that 


[T.+(0), T+(0')] = 5(26 — D)8"(o — 0!) + i(Ty.4(6) + T4(0'))3( — 0") 
(21.48) 
and a similar equation for 7__. The first term is clearly an obstruction to imposing 
the constraint, unless D = 26. The 26 arises from the energy-momentum tensor of 
the Faddeev—Popov ghosts. Were it not for the ghosts, strings would never make 
sense quantum mechanically. One can calculate this commutator painstakingly 
by decomposing in modes. But there are simpler methods, which also provide 
important insights into string theory, which we develop in the next section. 


21.4 Conformal invariance 
The analysis of conformal invariance is enormously simplified by passing to Eu- 
clidean space. Define: 


w=tt+io w=T-io. (21.49) 


The ws describe a cylinder. Again, in this section a’ = 2. This choice will make the 
coordinate space Green functions for the X“s very simple. The Euclidean action is 
now: 


1 
S= — | dw 0yX"dgX". (21.50) 
8 


In complex coordinates, the non-vanishing components of the energy-momentum 
tensor are: 


Livis = —dyX . Ow X Tow = —0,X rs Ow X. (21.51) 
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We saw in the previous section that the string action, in Minkowski coordinates, is 
invariant under the transformations: 


ot 


> f(o*) o> go). (21.52) 


In terms of the complex coordinates, this becomes invariance under the transfor- 
mations: 


w> fiw) wo f*@). (21.53) 


These are conformal transformations of the complex variable, and, as a result of 
this symmetry, we are able to bring all of the machinery of complex analysis to bear 
on this problem. One particularly useful conformal transformation is the mapping 
of the cylinder onto the complex plane: 


z=e” Z=e", (21.54) 


Under this mapping, surfaces of constant t on the cylinder are mapped into circles 
in the complex plane; t — —oo is mapped into the origin, and t — oo is mapped 
to oo. Surfaces of constant t are mapped into circles. 

It is convenient to write our previous expression for X“ in terms of the variable 
z. First write our previous expressions: 


1 : : 
X* =x" 4+ p*tt+i SS Sener ta + qe ere)) 


n#0 
= Xi + Xk, (21.55) 
where 
1 1 . 1 —in(t—o) 
Xp = xx" + =p*(t-—a)+i a —ae (21.56) 
2 2 noo” 
1 1 1 ~in(t+o) 
Xp = nxt +aop"(tt+o)t+i 2 Se errnoe, (21.57) 
2 2 Pera 


X, is holomorphic (analytic) in z, Xp is antiholomorphic: 
aX, =—iotz "| aXp=—iatz "|. (21.58) 


where aj = & = 5p". 

Let us evaluate the propagator of the xs in coordinate space. The Xs are just 
two-dimensional quantum fields. Their kinetic term, however, is somewhat uncon- 
ventional. Because we work with units a’ = 2, so the action has a factor of 1/87 out 
front. Accounting for the extra 47r, the coordinate space-propagator is (in Euclidean 
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space): 

ak eitk 
(27)? k?2 ° 
This equation is logarithmically divergent in the infrared. We can use this to our 


advantage, cutting off the integral at scale jz and isolating the In(jz|z — z’|). The 
logarithmic dependence can be seen almost by inspection of the integral: 


(X! (a), X"(0)) = 45" / (21.59) 


(X#(z)X"(z')) = 25" In(|z — z/|) = In(z — 2’) + In(Z — 2’) + In(w?). (21.60) 


As we will see shortly, the infrared cutoff will drop out of physically interesting 
quantities, so we will suppress it in the following. 

In the covariant formulation, conformal invariance is crucial to the quantum the- 
ory of strings. To understand the workings of two-dimensional conformal invari- 
ance, we can use techniques of complex variable theory, and the operator product 
expansion (OPE). We have discussed the operator product expansion previously, in 
the context of two-dimensional gauge anomalies. It is also important in QCD, in 
the analysis of various short-distance phenomena. The basic idea is that if one has 
two operators, O(z;) and O(z2), when z; — Z2, we have 


O:@1)Oj(2)_> Y > Cije(er — 22) On). (21.61) 
1722 k 


The coefficients C;;, are, in general, singular as z; > z2. The power follows from 
dimensional analysis. 

To implement this rather abstract statement, one can insert the two operators in 
a Green function with other operators, located at some distance from z . In other 
words, one studies: 


(Oj (Z1)O jZ2)V(Z3) V4) - --). (21.62) 


One can contract the operators in O(z,) with those in O(z2), obtaining expressions 
which are singular as z} —> Z2, or with the other operators, obtaining non-singular 
expressions. The leading term in the OPE will come from the term with the max- 
imum number of operators at z; contracted with operators at z2; less singular 
operators will arise when we contract fewer operators. 

As an example, which will be useful in a moment, consider the product 
aX"(z)dX"(w). If this appears in a Green function, the most singular term as 
zZ— w will be that where we contract 0X(z) with 0X(w). The result will be like 
the insertion of the unit operator at a point, times the singular function 1/(z — w)’, 
SO We can write: 

gi 
aX"(z)dX"(w) ~ —— + ---. (21.63) 
(z — w)? 
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A somewhat more non-trivial, and important, set of operator product expansions 
are provided by the stress tensor and derivatives of X: 


T(z)0X"(w) = 0X" (z)dX"(z)dX"(w). (21.64) 


Now the most singular term arises when we contract the 0 X(w) with one of the 
0X (z) factors in T(z). The other 0 X(z) is left alone; in Green’s functions, it must 
be contracted with other more far away operators. So we are left with: 


1 1 
T(z)0X(w) * ——— dX(w) + ——#X(w) +++. (21.65) 
(z — w) = w 
Another important set of operators will turn out to be exponentials of X: 
ke As 
T(z)e'?* = —— elk 4... (21.66) 
(Z — w) 


To get some sense of the utility of conformal invariance and OPEs, let’s compute 
the commutators of the as. Start with 


d 
ot = f see" ax", (21.67) 
IT 


where the contour is taken about the origin. Now use the fact that on the complex 
plane, time ordering becomes radial ordering, So, for |z| > |w|, 


T(aX"(z)dX"(w)) = (0X"(z)daX"’(w)) (21.68) 
For |z| < |w, 
T (0X"(z)dX”(w)) = (AX (w)d X"(z)). (21.69) 


Thus we have: 
dz dw dw dz 
KB gy] — — 7" @ —w" — od —w" © — 2" | 0x" xX” 21.7 


where the contour can be taken to be a circle about the origin. In the first term, we 
take |z| > |w|. In the second, |w| > |z|. Now to evaluate the integral, do, say, the 
z integral first. For fixed w, deform the z contour so that it encircles z (Fig. 21.1). 


Then 
i CD -2 Pie Le 
eeray = w 7s 
Qn 2n° (z—w) 


v 
= Mbming”. 


Let’s return to the stress tensor. We expect that the stress tensor is the generator 
of conformal transformations, and that its commutators should contain information 
about the dimensions of operators. What we have just learned, by example, is that 
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Fig. 21.1. Contour integral manipulations used to evaluate commutators in con- 
formal field theory. 


the operator products of operators encode the commutators. We could show by the 
Noether procedure that the stress tensor is the generator of conformal transforma- 
tions. But let’s simply check. Consider the transformation: 


Z—> zt+e(z). (21.71) 
We expect that the generator of this transformation is: 
f deTiet2) (21.72) 
We take the special case of an overall conformal rescaling: 
€(Z) = Az. (21.73) 


Now suppose that we have an operator, O(w), and that 


FOG) = Ow) 4 teas sna (21.74) 
(z — w)? 
Then 
| oe AchO(w) 
E f T(z)e(Z), ow) sare f az c= ie 
= Ow). (21.75) 


This means that under the conformal rescaling, O — hO, just as we expect for an 
operator of dimension h. As an example, consider O = (0)"X. This should have 
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dimension n, and the leading term in its OPE is just of the form of Eq. (21.74), with 
h=n. 
More precisely, an operator is called a primary field of dimension d if: 
dO 


Came rau. 


T(z)O(w) = (21.76) 
Note that 0X(z) is an example; e’?* is another. However, (0)"X is not, in general, 
as the 1/(z — w) term does not have quite the right form. A particularly interesting 
operator is the stress tensor itself. Naively, this has dimension two, but it is not 
a primary field. In the operator product expansion, the most singular term arises 
from the contraction of all of the derivative terms. This is proportional to the unit 
operator. The first subleading term, where one contracts just one pair of derivatives, 
gives a piece proportional to the stress tensor itself: 


T(z)T(w) = 


ea + Gay T(w). (21.77) 
When one includes the Faddeev—Popov ghosts, one finds that they give an additional 
contribution, changing D to D — 26. 

The algebra of the Fourier modes of T is known as the Virasoro algebra, and 
is important both in string theory, conformal field theory, and mathematics. In the 
string theory, it provides important constraints on states. Define the operators: 


1 n+l 
Ly, = — PD dzz"" T(z). (21.78) 
201 
In terms of these, 
SL 
R= aa (21.79) 
m=—CoO 


and similarly for z. Because the stress tensor is conserved, we are free to choose 
any time (radius for the circle). The operator product (21.77) is equivalent to the 
commutation relations above. Proceeding as we did for the commutators of the as, 
gives: 


D 
[Ln, Lm] = (m —n)Lman + De” — M)bn4n- (21.80) 
Using expression (21.16) we can construct the L,,s: 
1 2 1 
Ln = 5 Se eae Lae ; a Be ie (21.81) 


The colons indicates normal ordering. Only when m = 0 is this significant. In 
this case, we have to allow for the possibility of a normal ordering constant. 
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This constant is related to the constant we found in the Hamiltonian in light cone 
gauge. 


(oe) 
Lo = oo netyn—a Lo = oa Gyn — 4. (21.82) 


Now we want to consider the constraint on states corresponding to the classical 
vanishing of the stress tensor. Because of the commutation relations, we cannot 
require all of the Ls annihilate physical states. We require instead: 


Lm|V) =0 (21.83) 
for m > 0. Since LI, = L_,, this insures that 
(WIL, |Y) =OVn. (21.84) 
Because of the constraint Lo = Lo, at the first excited level, we have the state: 
le) = €yvort &? |p"). (21.85) 
The L,, for n > 1, trivially annihilate the state. Form = 1 we have: 
Lile) = a €w|p”). (21.86) 
Taking into account also L;, we have the conditions: 
Die” == Dyers (21.87) 


This is similar to the condition k,,¢" familiar in covariant gauge electrodynamics 
and eliminates the negative norm states. Consider, now, Lo: 


Lole) = (p? —a + Dle). (21.88) 


So if a = 1, the constraint is p = 0, as we expect from Lorentz invariance. For 
open strings there is an analogous construction. 


21.5 Vertex operators and the S-matrix 


We have argued that, when the cylinder is mapped to the plane, the creation or 
destruction of states is described by local operators, known as vertex operators. 
In this section, we discuss the properties of these operators and their construction. 
We explain how the space-time S-matrix is obtained from correlation functions of 
these operators, and compute a famous example. 
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21.5.1 Vertex operators 


There is a close correspondence between states and operators: z — 0 corresponds 
to t — —oo. So consider, for example, 


d,X"|0), (21.89) 
as z — 0. This is: 
00 lt 
a.X(z > 0)|0) = -i De ott IO). (21.90) 
All but the term m = —1 annihilate the state to the right. Combining this with a 


similar left-moving operator creates a single particle state. 

More generally, in conformal field theories, there is a one to one correspondence 
between states and operators. This is the realization of the picture discussed in the 
introduction. By mapping the string world sheet to the plane, the incoming/outgoing 
states have been mapped to points, and the production or annihilation of particles 
at these points is described by local operators. 

The construction of the S-matrix in string theory relies on this connection of 
states and operators. The operators which create and annihilate states are known as 
vertex operators. What properties should a vertex operator possess? The production 
of the particle should be represented as an integral over the string world sheet (so 
that there is no special point along the string). The expression 


[eve (21.91) 


should be invariant under conformal transformations. This means that the operator 
should possess dimension two; more precisely, it should possess dimension one 
with respect to both the left- and the right-moving stress tensors: 


T(z)V(w, w) = ae i w)+ shay w)t--- (21.92) 
(< — w)? Z—w 


and similarly for 7. An operator with this property is called a (1, 1) operator. 

A particularly important operator in two-dimensional free field theory (i.e. the 
string theories we are describing up to now) is constructed from the exponential of 
the scalar field: 


One (21.93) 
This has dimension 


d=p" (21.94) 
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with respect to the left-moving stress tensor, and similarly for the right-moving 
part. 

With these ingredients, we can construct operators of dimension (1, 1). These 
are in one to one correspondence with the states we have found in the light cone 
construction. 


(1) The tachyon: 
Pt pr =I, (21.95) 
(2) The graviton, antisymmetric tensor, and dilaton: 
EyOX"SX%eP* p* = 0. (21.96) 
The operator product 
IX? (z)AX p(ZEu»(p)IX" (w)aX '(wye'”* (w) (21.97) 


contains terms which go as 1/(z — w)> from contracting one of the derivatives in the 
stress tensor with e!?* and one with 0X“. Examining Eq. (21.92), this leads to the 
requirement 


pew (p) = 0 (21.98) 


which we expect for massless spin-2 states. In our earlier operator discussion, this was 
one of the Virasoro conditions. 
(3) Massive states: 


Ey-pn(P)OXMAXM ..-AXHnelP* p* = 1—n. (21.99) 


Obtaining the correct OPE with the stress tensor now gives a set of constraints on 
the polarization tensor; again these are just the Virasoro constraints. Without worrying 
about degeneracies, we have a formula for the masses: 


M=n-1. (21.100) 


This is what we found in the light cone gauge. Traditionally, the states were organized 
in terms of their spins. States of a given spin all lie on straight lines, known as “Regge 
trajectories.” These results are all in agreement with the light cone spectra we found 
earlier. 


21.5.2 The S-matrix 


Now we will make a guess as to how to construct an S-matrix. Our vertex opera- 
tors, integrated over the world sheet, are invariant under reparameterizations and 
conformal transformation of the world sheet coordinates. We have seen that they 
correspond to creation and annihilation of states in the far past and far future. We 
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will normalize the vertex operators so that: 


VA) V Ch) (21.101) 
Iz — w| 
So we study correlation functions of the form: 
[ @ar..-dan( Vile, pr). Valen) (21.102) 


We will include a coupling constant, g, with each vertex operator. 
Before evaluating this expression in special cases, let’s consider the problem of 
evaluating correlation functions of exponentials: 


(e§ LP XG)), (21.103) 


An easy way to evaluate this expression is to work in the path integral framework. 
Then the exponential has the structure 


/ d?zJ,(z)X(z), (21.104) 
where 


Jul2) = >> Pind — Zi). (21.105) 
But we know that the result of such a path integral is: 


exp ( is a zd?2 Iu (2Je(zAz = 2) = exp (~ pi> pjln|(zi — zn”), 


(21.106) 
where we have made a point of restoring the infrared cutoff. 
Let’s consider the infrared cutoff first. Overall, we have a factor: 
we Por, (21.107) 


This vanishes as 4 — 0, unless )> p; = 0, i.e. unless momentum is conserved. This 
result is related to the Mermin—Wagner—Coleman theorem that there is no sponta- 
neous breaking of global symmetries in two dimensions. Translational invariance 
is a global symmetry of the two-dimensional field theory; e’?* transforms under 
the symmetry. The only non-vanishing correlation functions are translationally 
invariant. 

This correlation function also has an ultraviolet problem, coming from thei = j 
terms in the sum. Eliminating these corresponds to normal ordering the vertex op- 
erators, and we will do this in what follows (we can, if we like, introduce an explicit 
ultraviolet cutoff; this gives a factor which can be absorbed into the definition of 
the vertex operators). 
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There is one more set of divergences we need to deal with. These are associated 
with a part of the conformal invariance we have not yet fixed. The operators Lo, L, 
and L_, forma closed algebra. On the plane, they generate overall rescalings (Lo), 
translations (L;) and more general transformations (Z_;) which can be unified in 
SL(2, C), the Mébius group. It acts on the world sheet coordinates as: 


oe az’ + B 
yzit+s 
These have the feature that they map the plane once into itself. It is necessary to 


fix this symmetry and divide by the volume of the corresponding gauge group. We 
can choose the location of three of the vertex operators, say z1, Z2, Z3. These are 


(21.108) 


conventionally taken to be 0, 1, oo. It is necessary also to divide by the volume of 
this group; the corresponding factor is 


Qu = |z1 — 22/7 lz1 — 2317122 — zl’. (21.109) 


One can simply accept that this emerges from a Faddeev—Popov condition, or derive 
this following the exercises at the end of the chapter. Finally, itis necessary to divide 
by g. This insures that a three-particle process is proportional to g,, a four particle 
amplitude to g?, and so on. 

Using these results we can construct particular scattering amplitudes. While 
physically somewhat uninteresting, the easiest to examine is simply the scattering 
of tachyons. Let’s specialize to the case of two incoming, two outgoing particles. 
Putting together our results above we have (remembering that z3 — 00): 


1 
he AB z4|21 — 22/7121 — 23" [za — Za] |e3|P* PPP» | zy — Zo | PP? | zg] PHP [zg — APH P2, 

M 
(21.110) 


Using momentum conservation, the z4 independent pieces cancel out in the limit, 
and we are left with: 


A= [ @acrrme = [PPP (21.111) 
Now we need an integral table: 
I= / [z|~4|1 — z|-3a?z (21.112) 
A B A+B 
= B(1—-—,1-—, a —1). 
2 2 2 
Here B is: 
Paroe)re) 


(21.113) 
Tat+brb+ol(ct+a) 
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We can express this result in terms of the Mandelstam invariants for 2 > 2 
scattering, s = —(p; + p2)*, t = —(p2 — p3)* and u = —(p; — p4)*. Using the 
mass shell conditions, 


aft ee _— Ld oe 2 
Pa P= 5 (ut (pi —pi)) Pa: po = —(p3+p2+Pi): po = 56 s—t+2m’), 
(21.114) 
gives 
Ke 
A= ay BAS + 1, 40 + 1, du + D. (21.115) 


This is the Virasoro—Shapiro amplitude. There are a number of interesting fea- 
tures of this amplitude. It has singularities at precisely the locations of the masses 
of the string states. It should be noted, also, that we have obtained this result by an 
analytic continuation. The original integral is only convergent for a range of mo- 
menta, corresponding, essentially, to sitting “below” the threshold for the tachyon 
in the intermediate states. 

We will not develop the machinery of open string amplitudes here, but it 
is similar. One again needs to compute correlation functions of vertex opera- 
tors. The vertex operators are somewhat different. Also, the boundary conditions 
for the two-dimensional fields, and thus the Green functions, are also different. 
The scattering amplitude for open string tachyons is known as the Veneziano 
formula. 


21.5.3 Factorization 


The appearance of poles in the S-matrix at the masses of the string states is no 
accident. We can understand it in terms of our vertex operator and OPE anal- 
ysis. Suppose that particles one and two, with momenta p; and po, have s = 
(pi + p2)? = —m?, the mass-squared of one of the physical states of the system. 
Consider the OPE of their vertex operators: 

el XG) gipr X@2) ~ el (Pit P2)-X (2) 7, a Zo |-PEP2, (21.116) 


So in the S-matrix, fixing z2 = 0, z3 = 1, and zq = oo, we encounter: 
fates ie OA inte fae, O17) 


Using momentum conservation and the on-shell conditions for p; and po: 


2po-pr=q°?-8 (21.118) 
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where g = p; + po. So the z integral gives a pole, 


1 


Wes 
4—q? 


(21.119) 


i.e. it vanishes when the intermediate state is an on-shell tachyon. 

This is general. Poles appear in the scattering amplitude when intermediate states 
go on-shell. The coefficients are precisely the couplings of the external states to the 
(nearly) on-shell physical state; this follows from the OPE. 


21.6 The S-matrix vs. the effective action 


The Virasoro—Shapiro and Veneziano amplitudes are beautiful formulas. Analogous 
formulas for the case of massless particles can be obtained. These are particularly 
important for the superstring. For many of the questions which interest us, we are not 
directly interested in the S-matrix. One feature of the string S-matrix construction 
is that it involves on-shell states; the momenta appearing in the exponential factors 
satisfy Pp? = —m”, where m is the mass of the state. So one cannot calculate, for 
example, the effective potential for the tachyon, since this requires that all momenta 
vanish. For massless particles things are better, since p = 0 is the limiting case of 
an on-shell process. But the S-matrix is not precisely the effective action. Instead, 
given the S-matrix, it is usually a straightforward matter to determine a low-energy 
effective action which will reproduce it. At tree level, one just needs to subtract 
massless particle exchanges. In loops, one must be more careful. 

It is particularly easy to extract three-point couplings of massless particles at 
tree level. One just needs to study an “S-matrix” for three particles (one can be a 
bit more careful and study a four-particle amplitude, isolating the coefficient of the 
massless propagator). From our previous analysis, we need 


1 
A= q,, (MiG DValz2) Vaz), (21.120) 
M 


where we don’t integrate over the locations of the vertex operators. We are free 
to take z, and zp arbitrarily close to one another. Then the operator product will 
involve 


1 
Vi (21) Va(Z2) © C123 ———, V3(Z2). (21.121) 
Zi 23| 


The final correlation function follows from the normalization of the vertex operators 
and cancels the Mobius volume. So the net result is that g,C 123 is the coupling. 

As an example, consider the coupling of two gravitons in the bosonic string. The 
vertex operator is 


Vi = Ep y(k)OX"(Z)AX (ze *O, (21.122) 
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and similarly for V2 and V3. So the operator product has the structure: 


Vi (z)V2(w) = ig= we 


: 1 es 
+ €yvlki Epa (ka et) XO) Gis ——— aX" (z)dXP(z) +++ ) . (21.123) 


Iz — w| 

Here the first term arises from the contraction of all of the 0X terms with each 
other. Loosely speaking, it is related to the production of off-shell tachyons. We will 
ignore it. The second term that we have indicated explicitly comes from contracting 
the first 0X factor with the second exponential, and the second dX factor with 
the first exponential. The dots indicate a long set of contractions. The complete 
vertex is precisely the on-shell coupling of three gravitons in Einstein’s theory, 
along with couplings to the antisymmetric tensor and dilaton. We will not worry 
with the details here. When we discuss the heterotic string, we will show that the 
theory completely reproduces the Yang-Mills vertex in much the same way. We 
shouldn’t be surprised that it is difficult to define off-shell Green functions. In 
gravity, apart from the S-matrix, it is hard to define generally coordinate-invariant 
observables. 


21.7 Loop amplitudes 


So far, we have considered tree amplitudes. Closed or open strings interact by split- 
ting and joining. Once we allow for quantum fluctuations, strings in intermediate 
states can split and join. Because of conformal invariance, the only invariant char- 
acteristic of these diagrams is their topology (for closed strings, the tree level world 
sheet has the topology of a sphere). In the closed string case, each additional loop 
adds a handle to the world sheet. In general, the theory of string loops is com- 
plicated. But the description of one-loop diagrams is rather simple, and exposes 
important features of the theory not apparent in tree diagrams. In the case of closed 
strings, requiring that the one-loop amplitude be sensible places strong constraints 
on the theory. Invariance under certain (global) two-dimensional general coordinate 
transformations, known as modular transformations, will account for many of the 
features of both the bosonic and superstring theories. In space-time, satisfying these 
constraints is a necessary condition for the unitarity of the scattering amplitude. 
In this section, we provide only a brief introduction. We will leave for later the 
discussion of open string loops. 

The one-loop amplitude has the topology of a donut, or torus. A simple repre- 
sentation of a torus is as indicated in Fig. 21.2. In this figure, the world sheet is 
flat and of finite size. We can think of this torus as living in the complex plane. 
It is (up to conformal transformations) the world sheet appearing in the Euclidean 
path integral. The two possible periods of the donut are translated into two complex 
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/_} 


Fig. 21.2. A simple representation of a torus. 


periods, A; and A. We require that the fields are periodic under 
Ze z+maAy tnd. (21.124) 


We can transform A, and A, by a transformation in the “modular group,” SL(2, Z): 


MM _ a b nu 
(= (a) ) ow 


with a, b, c and d integers satisfying ad — bc = 1, provided we also transform the 
integers n and m by the inverse matrix: 


G)-(2 2G) ats 


Now rescale z by A,, and call tT = A2/A;. Then z has the periodicities 1 and rT. 
Under modular transformations, t transforms as: 


at+b 


Tt eyed 


(21.127) 


The modular transformations are general coordinate transformations of the world 
sheet theory, but they are not continuously connected to the identity. In order that 
one-loop string amplitudes make sense, we require that they are invariant under this 
transformation. The general amplitude will be a correlation function 


(V(z1) V(Z2) a -) torus (21.128) 


evaluated on the torus. The simplest amplitude is that with no vertex operators 
inserted. (At tree level, this amplitude vanishes owing to the division by the infinite 
Mobius volume.) For the bosonic string, we can evaluate the amplitude in light 
cone gauge. We simply need to evaluate the functional determinant. As these are 
free fields on a flat space, this is not too difficult. It is helpful to remember some 
basic field theory facts. The path integral, with initial configuration ¢;(x) and final 


21.7 Loop amplitudes 337 
configuration d(x) computes the quantum mechanical matrix element: 
(dele "7 Idi). (21.129) 


If we take the time to be Euclidean, impose periodic boundary conditions, and sum 
(integrate) over all possible ¢;, we have computed: 


Treo". (21.130) 


i.e. the quantum mechanical partition function. As described in Appendix C, this 
observation is the basis of the standard treatments of finite temperature phenomena 
in quantum field theory. In the present case, the periodicity is in the t direction. So 
we compute 


Tr eee, (21.131) 


It is convenient to rewrite the light cone Hamiltonian, He, in terms of Lo and Le 
Introducing 


que gael (21.132) 
we want to evaluate: 
Trg! gq’. (21.133) 


From any one oscillator with oscillator number n, just as in quantum mechanics, we 
obtain (1 — q”)~!; so allowing for the different values of n and the D — 2 transverse 
directions, we have: 


| [a?™a?™a _ gy PL = gy. (21.134) 


This is conveniently expressed in terms of a standard function, the Dedekind 7 
function: 


nq) =4'™* | [a —4q"). (21.135) 


n=1 


We also need the contribution of the zero modes. This is: 
(21.136) 


In the final expression, we need to integrate over t. The measure for this can be 
derived from the Faddeev—Popov ghost procedure, but it can be guessed from the 
requirement of modular invariance. It is easy to check that 

ae 


2 
T} 


(21.137) 
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is invariant. So, in 26 dimensions, we finally have 


da 
Zo f Serine’. (21.138) 
2 


Now to check that this is modular invariant, we note, first, that the full modular 
group is generated by the transformations: 


tott+tl1 to -l/t. (21.139) 


Under these transformations, as we said, the measure is invariant. The 7 function 
transforms as: 


nt +1) =e8n(t) (—1/t) = (—it)!/?n(r) (21.140) 


Since tT — T)/ Ge + oe under t + —1/t, Z is invariant. Here we see that the 
bosonic string makes sense only in 26 dimensions. 


Suggested reading 


More detail on the material in this chapter can be found in Green et al. (1987) and 
in Polchinski (1998). The light cone treatment described here is nicely developed 
in Peskin (1985). 


Exercises 


(1) Enumerate the states of the bosonic closed string at the first level with positive mass- 
squared. Don’t worry about organizing them into irreducible representations, but list 
their spins. 

(2) OPEs: explain why X” and X” do not have a sensible operator product expansion. Work 
out the OPE of 0X” and 0X” as in the text. Verify the commutator of a and a” as in 
the text. 

(3) Work out the Virasoro algebra, starting with the operator product expansion for the 
stress tensor, and using the contour method. 

(4) Mermin—Wagner—Coleman Theorem: consider a free two-dimensional quantum field 
theory with a single, massless, complex field, @. Describe the conserved U(1) symmetry. 
Show that correlation functions of the form 


(cindOn) pind) (21.141) 


are non-vanishing only if }> g; = 0. Argue that this means that the global symmetry is 
not broken. From this construct an argument that global symmetries are never broken 
in two dimensions. 

(5) Show that Qy of Eq. (21.109) is invariant under the Mobius group. You might want to 
proceed by analogy to the Faddeev—Popov procedure in gauge theories. 
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(6) Show that the factorization of tree-level S-matrix elements is general, i.e. that if the 
kinematics are correctly chosen for two incoming particles, 1 and 2, so that (p, + pr)? © 
m2, so that the amplitude is approximately a product of the coupling of particles 1 and 
2 ton, times a nearly on-shell propagator for the field n. 


22 


The superstring 


The theories we have described were motivated by thinking of a picture of a string 
moving in space-time. We arrived in this way at a description of strings in terms 
of two-dimensional quantum fields. The theories, so far, are theories of bosons 
only. But in this more abstract picture, we can imagine adding two-dimensional 
fermionic fields as well. This possibility was first considered by Ramond, Neveu 
and Schwarz, and leads to the superstring theories: Type I, Type ITA and Type IIB, 
and the two heterotic string theories. We first develop the theories in light cone 
gauge, where their spectra are readily exhibited. Then we discuss interactions. 


22.1 Open superstrings 


A priori, there appears to be a great deal of freedom in how we introduce fermions: 
their number, their representations under the (space-time) Lorentz group, and pos- 
sibly other options. Various consistency conditions restrict these choices. In the 
case of open strings, we have to introduce one fermion, yl , for each coordinate, 
X’. For the action of the fermions we take: 


1 yaork ay 
Sy = — | doi dgy". (22.1) 
20 
In two dimensions, a particularly simple choice for the y-matrices is: 
yo=o. y! =ioj (22.2) 


and the analog of ys in four dimensions is 
Y3 = 03. (22.3) 


The Dirac equation, in this basis, is purely imaginary, so we can take the fermions 
to be real (Majorana). We can work with eigenfunctions of 03: 


_ 22.4 
v (v; ; (22.4) 
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In this way, if we again introduce light cone coordinates on the world sheet, 


ot=tt+0 (22.5) 


the action becomes: 


1 


= = | Powtawt +wlaw!). (22.6) 


We need to impose boundary conditions at the string end points. To determine 
suitable boundary conditions, we vary the Lagrangian to obtain the Euler-Lagrange 
equations. The surface terms which arise in the variation involve widW4, — w_dw_. 
So the boundary terms vanish if w4 = +w_. An overall sign doesn’t matter, so we 
can take the + sign at o = 0: 


vi, 7) = ¥1@, z) (22.7) 
This leaves two choices for the boundary conditions at o = zm: 
wha, t) = +wl (x, 7). (22.8) 


Fermions which obey the boundary condition with the + sign are called Ramond 
fermions; those with the — sign are called Neveu—Schwarz (NS) fermions. Corre- 
sponding to the Ramond case are the mode expansions: 


1 : 1 , 
wi =—) diet yl = —)y die inet), (22.9) 
J/2 d, J2 dX 


In the NS case we have: 
1 ; 1 
wl ae > bee wl 2G pte yy bere, (22.10) 
V2 V2 Ss 
Now we quantize these fields: 


{W'(o, t)2, W7(o", t)4} = 15(o — ')5!78 (22.11) 


This gives, for the modes: 
(oid ao ieee (ded, pO Omen (22.12) 
The Hamiltonian in light cone gauge, for the Ramond sector, is: 
H=p’+Ny+Na. (22.13) 


Here the Ns are the various number operators: 


CO CO 
Ne= >! Oy On, Ne= > mid nam, (22.14) 
m=1 m=1 
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For the NS sector, Ny is replaced by Np: 


oe) 


Np = > mb_,b,. (22.15) 
r=1/2 


Each of these Hamiltonians has a normal ordering constant. We will determine 
these shortly. The states of the theory are the eigenstates of the fermion number 
operators bhp, d'dp, etc., for non-zero n. The eigenvalues can take the values 0 
or | in each case. The zero modes, which arise in the Ramond sector, are special. 
They give rise to space-time fermions. 


22.2 Quantization in the Ramond sector: the appearance of 
space-time fermions 


Usually, we do field theory at infinite volume, but here we are considering field 
theory at a finite volume (0 < o < 7), and this has introduced some new features. 
For the bosonic fields, X’, we have already seen that there are zero modes, which 
gave rise to the coordinates and momenta of space-time. For the fermions, we 
now have the new feature that there are two sectors, with two independent Hilbert 
spaces. It is tempting to simply keep one, but it turns out that when we consider 
string interactions, it is necessary to include both: even if we attempt to exclude, 
say, the Ramond states, they will appear in string loop diagrams. 

There is another feature: the appearance of fermion zero modes (dj) in the 
Ramond sector. These are not conventional creation and annihilation operators. 
They obey the commutation relations: 


tagvdn | =e (22.16) 


These are, up to a factor of 2, the anticommutation relations of Dirac gamma 
matrices for a D — 2-dimensional space, i.e. they are associated with the group 
O(D — 2). Anticipating the fact that D = 10, we are interested in the Dirac matrices 
of O(8). Before giving a construction of the spinor representations of O(8), let us 
first simply state the basic result: O(8) has two spinor representations, 8, and 8), and 
a vector representation, 8,, all 8-dimensional. So we can realize the commutation 
relations, not on a Fock space, but on one of the 8-dimensional representations of 
O(8). Labeling these states a, a, then 


1 
I 
Wr Yaa: 
We can construct an explicit representation for these matrices in various ways. 
A simple, and easy to remember construction, is to think of O(8) as acting on eight 


(a|do|a) = (22.17) 
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coordinates, x’. Group these into complex coordinates: 


Pex die 2eaexrtik? Paar tix® gaa tix? (22.18) 
and their complex conjugates. This defines an embedding of U(4) in O(8). Corre- 
spondingly, we define 


a! = (dj + id§), (22.19) 
etc. The a's obey the commutation relations: 
fa’, a!"} = 8%, (22.20) 


all others vanishing. These are just the conventional anticommutation relations of 
fermion creation and annihilation operators (but remember for this discussion, these 
are just matrices, and shouldn’t be confused with the d,s, which are genuinely 
creation and annihilation operators). Among products of these operators we can 
distinguish two classes: those built from an even number of as, and those built from 
an odd number. In four dimensions, the analogous distinction corresponds to the 
eigenvalue (+1) of ye 

Now we define a state, |0), annihilated by the a's. We can then form two sets of 
states, those with “even fermion number” and those with odd. The even states are: 


10) altatt}0)  aMa*tatta* 0). (22.21) 


These states form one of the 8 representations, say 8;. The second is formed by the 
states of odd fermion number. States are now labeled |p’, a, {oscillators}). 

What we have learned is that the states in the Ramond sector are space-time 
fermions; the states in the NS sector are space-time bosons. 


22.3 Type Il theory 


For closed strings, we still have two-component fields yw, but the possible choices 
of boundary conditions are somewhat different. We still require that the fermion 
surface terms vanish, but we also require that currents such as y . w i be periodic. 
(These currents are part of the generators of rotations in space-time.) So we impose 
Ramond and Neveu—Schwarz boundary conditions independently on the left and 
right movers. Indeed, we treat the left- and right-moving fermions as independent 
fields. Recalling that the Lagrangian for the fermions breaks up into left- and right- 
moving parts. They have the mode expansions: 


wv! = Soe vw! ae > bre ee) (2299) 
neZ neZ+1/2 


in the Ramond and NS sectors, respectively, and 


wv _ Nee, re = Spee (22.23) 
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The light cone Hamiltonian is now: 
H=p*+N,+Na+NatNq—a. (22.24) 


In constructing the spectrum, this must be supplemented with the condition of 
invariance under shifts in o; in the covariant formulation, this was the Ly = Lo 
constraint. 


22.4 World sheet supersymmetry 


Before considering the spectrum, we consider the question of supersymmetry. The 
theory we are considering is supersymmetric in two dimensions. Just as we decom- 
posed the fermions into left and right movers, we can introduce a two-component 


anticommuting parameter 6: 
d= a) : (22.25) 
04 


IT_ yl 4 jy 1, I 
Y'=xX'+0w ee (22.26) 


Then we define the superfield: 


We will see shortly that B/ is an auxiliary field, which in the case of strings in flat 
space, we can set to zero by its equations of motion. The supersymmetry generators 
are: 
cae 
On = = +i(¥"O) A 0a (22.27) 
004 

(we are using the capital letter A for two-dimensional spinor indices here, to dis- 
tinguish them from the letter a, which we used for O(8) spinor indices, and the 
letter w~, which we used for two-dimensional vector indices). As in four dimen- 
sions, we can introduce a covariant derivative operator which anticommutes with 
the supersymmetry generators: 


0 


D= = —iy"00, 22.28 
age) (22.28) 
In terms of the superfields, the action may be written in a manifestly invariant way: 
S= Z| @odoby"oy, 
Ar 
—] - 

= = | Pocaxtex! —ip'y*daw) — B’B"). (22.29) 

1 


Note that B/ vanishes by its equations of motion. 
Finally, note that in the NS sectors, the boundary conditions explicitly break 
the world sheet supersymmetry; they map bosonic fields into fermionic fields, 
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and vice versa, which obey different boundary conditions. The Ramond sector is 
supersymmetric. 

In the covariant formulation, this supersymmetry is essential to understanding 
the full set of constraints on the states. But it is important to stress that it is a 
symmetry of the world sheet theory; its implications for the theory in space-time 
are subtle. 


22.5 The spectra of the superstrings 


We have, so far, considered the world sheet structure of the superstring theories. We 
have not yet explored their spectra in detail. As in the case of the bosonic string, we 
will see that these theories possess a massless graviton. We will also find that they 
have a massless spin-3/2 particle, the gravitino. Consistent couplings of such a 
particle require that the space-time theory is supersymmetric. 


22.5.1 The normal ordering constants 


First, we give a general formula for the normal ordering constant. This is related to 
the algebra of the energy-momentum tensor we have discussed in Section 21.4. For 
a left- or right-moving boson, with modes which differ from an integer by n (e.g. 
modes are 1 — n, 2 — n, etc.), the contribution to the normal ordering constant is: 


1 1 
A=-—+-n(1—7). 22.30 
rae n) ( ) 


For fermions, the contribution is the opposite. So we can recover some familiar 
results. In the bosonic string, with 24 transverse degrees of freedom, we see that 
the normal ordering constant is —1. For the superstring, in the NS—NS sector, we 
have a contribution of —1/24 for each boson, and 1/24 — 1/16 for each of the eight 
fermions on the left (and similarly on the right). So the normal ordering constant 
is —1/2. For the RR sector, the normal ordering vanishes. 

There are simple derivations of this formula, whose justification requires careful 
consideration of conformal field theory. The normal ordering constant is just the 
vacuum energy of the corresponding two-dimensional free field theory. So we need 


1 CO 
f)=5 S\(n +n). (22.31) 
1 


Ignoring the fact that the sum is ill-defined, we can shift n by one, and compensate 
by a change in 7: 


1 
IM fat aU aap: (22.32) 
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If we assume that the result is quadratic in n, we recover the formula above, up 
to the constant. We can “calculate” this constant by the following trick, known as 
zeta-function regularization. For 7 = 0, we need: 


yin — lim Yon™, (22.33) 


The object on the right-hand side of this equation is ¢(s), the Riemann zeta function. 
The analytic structure of this function is something of great interest to mathemati- 
cians, but one well known fact is that its singularities lie off the real axis. Using 
integral representations, one can derive a standard result: ¢(—1) = —1/12. This 
fixes the constant as —1/24. This argument may (should) appear questionable to 
the reader. The real justification comes from considering questions in conformal 
field theory. 


22.5.2 The different sectors of the Type I theory 


In the Type II theory, there are four possible choices of boundary conditions: NS 
for both left and right movers, Ramond for both left and right movers, Ramond 
for left and NS for right and NS for left and R for right. We will refer to these 
as the NS—NS, R-R, R—NS and NS-R sectors. Consider, first, the NS—NS sector. 
There are no zero-mode fermions, so we just have a normal (unique) ground state 
for the oscillators. From our computation of the normal ordering constants in the 
previous section, we see that a = —1/2 for both left and right movers. The lowest 
state is simply the state |p). It has mass-squared —1 (in units with a’ = 2). Since 
no oscillators are excited, the Ly = Lo condition is satisfied. Now consider the first 
excited states. Again, we must have invariance under o translations, so these are 
the states: 


We pW. lB): (22.34) 


Because a = —1/2 for both left and right movers, these states are massless. The 
symmetric combination here contains a scalar and a massless spin-two particle, 
the graviton; the antisymmetric combination is an antisymmetric tensor field. At 
the next level, we can create massive states using four space-time fermions or two 
bosons, or one fermion or two bosons. 

Let’s turn to the other sectors. Consider, first, the R-NS sector, where y is 
Ramond, wv is NS. Now, the left-moving normal ordering constant is zero, while 
the right-moving constant is —1/2. So we can satisfy the level-matching condition 
(invariance under o translations) if we take the left movers to be in their ground 
state and take the right-moving NS state to be an excitation with a single fermion 
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operator above the ground state, i.e.: 
|W!) = WH pla BD. (22.35) 


From the space-time viewpoint, these are particles of spin-3/2 and 1/2. In the NS-R 
sector, we have another spin-3/2 particle. 

Just as a massless spin-two particle requires that the underlying theory be gen- 
erally covariant, a massless spin-3/2 particle, as we discussed in the context of 
four-dimensional field theories, requires space-time supersymmetry. But now we 
seem to have a paradox. With space-time supersymmetry, we can’t have tachyons, 
yet our lowest state in the NS—NS sector, |p), is a tachyon. 

The solution to this paradox was discovered by Gliozzi, Scherk and Olive, who 
argued that it is necessary to project out states, i.e. to keep only states in the spectrum 
which satisfy a particular condition. This projection, which yields a consistent super- 
symmetric theory, is known as the GSO projection. Note, first, that we have been a bit 
sloppy with the fermion indices on the ground states. We have two types of fermion 
indices, a and a, corresponding to the two spinor representations of O(8). So we do 
the following. We keep only states on the left which are odd under left-moving world 
sheet fermion number; we do the same on the right, but we include in the definition 
of world sheet fermion number the chirality of the zero-mode states. We take 


(41)" = ety” x iF Lip Yn¥—n (22.36) 


In the R-NS sector, we make a similar set of projections. Here we have a choice, 
however, in which chirality we take. If we take the opposite chirality, we get the 
Type IIA theory; if we take the same chirality, we get the Type IIB theory. 

Returning to the NS—NS sector, we make a similar projection, keeping only states 
which are odd under both left- and right-moving fermion number. In this way we 
eliminate the would-be tachyon in this sector. 

Somewhat more puzzling is the R-R sector in each theory. Here both the left- and 
right-moving ground states are spinors. So in space-time, the states are bosons. We 
can organize them as tensors by constructing antisymmetric products of y-matrices, 
yJ‘~. As we know from our experience in four dimensions, these form irreducible 
representations, in this case of the little group O(8). Thinking of our construction 
of the y-matrices in terms of the as, we can see ys with even numbers of indices 
connect states of opposite chirality, while those with odd numbers connect states 
with the same chirality. Which tensors appear depends on whether we consider the 
IIA or IIB theories. In the IIA case, only the tensors of even rank are non-vanishing. 
These tensors correspond to field strengths (one can consider an analogy with the 
magnetic moment coupling in electrodynamics, wy’ w). So in the IIA theory, one 
has second- and fourth-rank tensors; the sixth- and eighth-rank field strengths are 
dual to these. In terms of gauge fields, there is a one-index tensor (a vector), and a 
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third-rank antisymmetric tensor. In the IIB theory, there is a scalar, a second-rank 
tensor and a fourth-rank tensor. In string perturbation theory, because the couplings 
are through the field strengths, there are no objects carrying the fundamental charge. 
Later we will see that there are non-perturbative objects, D-branes, which carry 
these charges. 


22.5.3 Other possibilities: modular invariance and the GSO projection 


The reader may feel that the choices of projections, and for that matter the choices 
of representations for the two-dimensional fermions, seem rather arbitrary. It turns 
out that the possible choices, at least for flat background space-times, are highly 
restricted. There are only a few consistent theories. Those we have described are the 
only ones without tachyons, and with both left- and right-moving supersymmetries 
on the world sheet. 

In the bosonic string theory, we saw that it was crucial that the theory be formu- 
lated in 26 dimensions. One of the problems with the theory outside of 26 dimen- 
sions was that it is not modular invariant. This means that it is not invariant under cer- 
tain global two-dimensional general coordinate transformations. This world sheet 
anomaly is correlated with anomalies in space-time. As for the gauge anomalies in 
field theories, these lead to breakdown of unitarity, Lorentz invariance, or both. 

For the superstring theories, we will now explain why modular invariance de- 
mands a projection like the GSO projection. The point is that modular invariance 
relates sectors with different choices of boundary condition. 

In our discussion of string theories up to this point, path integrals have appeared 
occasionally, but they are extremely useful in discussing string perturbation theory. 
The propagation of strings can be described by a two-dimensional path integral, with 
the string action, in much the same way as the amplitude for the motion of a particle. 
At tree level, the closed string world sheet has the topology of a sphere. At one loop, 
it has the topology of a torus. So at one loop, string amplitudes can be described 
as path integrals of a two-dimensional field theory on a torus. Note that we need, 
here, the full path integral, not simply the generator of Green’s function for the field 
theory. The path integral on the torus, with no insertion of vertex operators, yields 
the partition function of the two-dimensional field theory. To understand this, let’s 
consider the fermion partition function. Actually, there are several fermion partition 
functions. Let’s begin with a single, right-moving Majorana fermion, and take, first, 
Neveu—Schwarz boundary conditions. There are two sorts of partition function we 
might define. First: 


Trg = I] fda’): (22.37) 
r=1/2 
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Alternatively, we can evaluate: 


Tr(—1)"q” = [|] -q’). (22.38) 
r=1/2 


From a path integral point of view, the first expression is like a standard thermal 
partition function. It can be represented as a path integral with anti-periodic bound- 
ary conditions in the time direction. The second integral corresponds to a path 
integral with even boundary conditions for fermions in the time direction. We can 
represent the torus as in Fig. 21.2. Taking the vertical direction to be the time direc- 
tion and the horizontal direction the space direction, we can indicate the boundary 
conditions with plus and minus signs along the sides of the square. Recalling the 
action of modular transformations on the torus, however, we see that the modular 
group mixes up the various boundary conditions. Not only does it mix the temporal 
boundary conditions, but it mixes the spatial boundary conditions as well. 

It will be convenient for much of our later analysis to group the fermions in 
complex pairs. In the present case, this grouping is rather arbitrary, say WV! = 
w! +i? and so on. Then the partition functions can be conveniently written in 
terms of 6 functions. These functions, which have been extensively studied by 
mathematicians, transform nicely under modular transformations: 


0 . 00 | 
v (0, tT) = n(t) e219 g@?/2-1/24 I] [1 zo eign) 


d m=1 
[Lae ge a |, (22.39) 
Under t > t+ 1, 
0 ‘| (0,t +1) = ei 9-9 9 e (0, T) (22.40) 
p ~—8 
while, under t > —1/T, 
0 H (0, 1/t) = e749 Pe | (0, t). (22.41) 


These transformation properties have a physical interpretation. Returning to 
Eqs. (21.125)-(21.127), the transformation t — —1/t exchanges the time and 
space directions of the torus. So these transformations interchange sectors with a 
given projection (multiplication of states by a phase) with states with a twist in 
the space direction. This is precisely what one would expect from a path integral, 
where boundary conditions in the time direction correspond to weighting of states 
with (symmetry) phases. 
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Calling 


7 1 a/2 
22.42 
Z3(T) = re ale (0, T) ( ) 


the partition function for the eight fermions in the NS sector is (Z5y, for example. 
If we include a(—1)” factor, this is replaced by (Z es We can work out the similar 
expressions for the Ramond sector. From our expression for the transformation of 
the @ functions, it is clear that no one of these is modular invariant by itself, as 
we would expect from our path integral arguments. So it is necessary to combine 
them, and include also the eight bosons. When we do, we have the possibility of 
including minus signs (in more general situations, as we will see later, we will have 
more complicated possible phase choices). There are a finite number of possible 
choices. Two that work are: 


Zi = ; [Z9(z)* — ZP(t)* + Zh(t)* F Zo)". (22.43) 


These transform simply under the modular transformations; all of the terms trans- 
form to each other, up to an overall factor. There is a similar factor from the 
left-moving fermions (where one need not, a priori, take the same phase). Recall 
that the bosonic partition function is 


Zx(t) = (40/7 |\n(q) |. (22.44) 


Here the n function comes from the oscillators. The t2 factors come from the 
integration over the momenta. There are two additional such factors, coming from 
the integrals over the two light cone momenta. So the full partition function is: 


lee 42 
z=c f Szyzrwztoy. (22.45) 
T) 


It is not hard to check that this expression is modular invariant. 
If we examine the partition function carefully, we see that we have uncovered 
the GSO projection. Consider the first two terms in Z~. This is just 


Tr(1 — (—1)")ns, (22.46) 


Le. it says that the physical states of the theory, in the NS sector, are only those of 
odd fermion number. There is a similar projector in the Ramond sector. The two 
possible choices of left- relative to right-moving Zs correspond precisely to the 
two possible supersymmetric string theories. Our original argument for the GSO 
projector was consistency in space-time, but here we have a more direct, world 
sheet consistency argument. 
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Fig. 22.1. Deforming the diagram for open string scattering reveals an intermediate 
closed string state. 


These are the only choices of phases which lead to supersymmetric strings in ten 
dimensions. However, there are other choices which lead to non-supersymmetric 
strings. These give the Type 0 superstring. We will leave consideration of these 
theories to the exercises. 


22.5.4 More on the Type I theory: gauge groups 


In our discussion of the bosonic string theory, we mentioned that one can obtain 
non-Abelian gauge groups by allowing charges at the ends of the strings. There 
are an infinite set of possibilities, which we did not explore, as all of these theories 
have other problematic features if one is trying to describe Nature. 

In the case of open superstrings, it turns out that the possible structures are quite 
constrained. First, it is necessary to include closed strings as well in order to obtain 
a unitary theory. This can be seen by considering scattering of four open strings. 
By stretching the diagram of Fig. 22.1, one can see that closed strings appear 
in intermediate states. These strings cannot be oriented. This leads to a different 
structure in the closed string sector than we saw in the IIA or IIB theories. It is 
necessary to require that states be symmetric under exchange of left- and right- 
moving quantum numbers. We will discuss the required projection later when we 
talk about D-branes and orientifold planes. 

Second, it turns out that absence of anomalies fixes uniquely the gauge symmetry 
to be O(32). From the point of view of our experience with four-dimensional 
anomalies, this is somewhat surprising, but it turns out that in ten dimensions 
supergravity by itself can be anomalous, and this is the case for the open string. 
Allowing for charges at the end of the string, leads to a set of additional mixed 
gauge and gravitational anomalies. Almost miraculously, if one takes the ends of 
the string to lie in the vector representation of O(32), all anomalies cancel. 
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22.6 Manifest space-time supersymmetry: the Green—Schwarz formalism 


In the Ramond—Neveu—Schwarz formalism, space-time supersymmetry is obscure. 
It only arises after imposing the GSO projector. The supersymmetry operators must 
connect the different sectors — essentially different two dimensional field theories. 
These operators can be constructed, though we will not do that in this text. Instead, 
we consider in this section a different formalism, the Green—Schwarz formalism, in 
which the space-time supersymmetry is manifest. This formalism is best understood 
in the light cone gauge. 

In the Green—Schwarz formalism, one still has the bosonic coordinates, X/, 
but the eight fermionic coordinates, y! , in the vector representation of O(8), are 
replaced by eight fermionic coordinates in one of the spinor representations of O(8) 
(we have already seen that O(8) possesses two spinor representations of opposite 
chirality). These are usually written as S“(o, t). Their Lagrangian is: 


ae 
C= 5S" 0" 8,5, 22.47 
ee eo ( ) 


where we have written the Ss as two component fermions, and o% denotes the 
two-dimensional y-matrices. The S,s can be taken real (Majorana). They can be 
decomposed into left and right movers, S. Unlike the case of RNS fermions, both 
for closed and open strings, one has only one boundary condition. As for the RNS 
fermions, for open strings, the boundary condition relates the left and right movers: 


SL, +) = S200, t) SiG, t) = S2(, tr). (22.48) 
For the closed strings, one simply has periodicity, 


Si(o +7, t) = S4(6, 7). (22.49) 


The mode expansions, in the case of closed strings, are: 


oo) 
S2 Hi eS Se ee 
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oe) 
SE a Beer (22.50) 


—C 
The S,,s obey the anticommutation relations: 
Sec S Va Or One [See ore. (22.51) 


For non-zero n these are canonical fermion creation and annihilation operator an- 
ticommutation relations. Because of their quantum numbers, the Ss, acting on 
space-time bosonic states, produce fermionic states, and vice versa. 
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The light cone Hamiltonian, in terms of these fields, takes the form: 


H= «ply +N+N), (22.52) 
2pt 


N=)- (a!,,04, + mS4,,58) N= mA a! a), +mS%,,8%). (22.53) 


Note that there is no normal ordering constant; more precisely, the normal ordering 
constants associated with the left- and right-moving fields vanish, because the 
contributions of the bosonic and fermionic fields cancel (as they do in the Ramond 
sector of the superstring). 

As in the Ramond sectors of the superstring theories, the anticommutation rela- 
tions of the zero modes are important and interesting: 


era eee ae (22.54) 


Again they are similar to the anticommutation relations of Dirac y-matrices, but 
now the indices are different than in the RNS case. The solution is to allow So to 
act on sixteen states, eight of which carry spinor labels, b, and eight of which carry 
O(8) vector labels, 7. Then 


(Seba (22.55) 


We’ II leave the verification of this relation for the exercises, and proceed directly 
to the identification of the massless states of the closed string theories. The ITA and 
IIB theories are distinguished by the relative helicities of the S and S fields. In the 
IIA case, they are opposite; in the IIB case the same. The massless fields are just 
obtained by tensoring the left and right states of the zero modes. The states 


€zs|1) x |J) (22.56) 


are the graviton, B-field and dilaton; the states where J — a or J — a, are the two 
gravitini of the theory; those where both J and J are replaced by spinor indices are 
the states we discovered in the Ramond—Ramond sector of the superstring theories. 
In this formalism, the space-time supersymmetry is manifest. There are two types 
of supersymmetry generators. One generates not only space-time supersymmetries, 
but world sheet supersymmetries as well. This is as it should be; the world sheet 
Hamiltonian in the light cone gauge is also the space-time Hamiltonian. 


Q* = 


De Sia. (22.57) 
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The second set are built of the zero modes alone: 


Q* = V2P*S5. (22.58) 
The supersymmetry generators obey the commutation relations: 
(0°, O°} = 2P*5” (22.59) 
(0°, O°} = V2y1,P!. (22.60) 
(0%, O°} = 2H8”. (22.61) 


The manifest supersymmetry and the close connection between world sheet and 
space-time supersymmetries makes the Green—Schwarz formalism a powerful 
tool, both conceptually and computationally, despite its lack of manifest Lorentz 
invariance. 


22.7 Vertex operators 


Because there are more world sheet fields in the superstring than in the bosonic 
string, the vertex operators are more complicated. In the RNS formalism, the 
supersymmetry on the world sheet is a relic of a larger, local supersymmetry, 
much as conformal invariance is a relic of the general coordinate invariance of 
the two-dimensional supersymmetry. The resulting superconformal symmetry pro- 
vides constraints on vertex operators beyond those of the Virasoro algebra. These 
constraints can be implemented in a variety of ways, depending on how one treats 
the superconformal ghosts. In the simplest version, the vertex operators must be 
supersymmetric. In the case of the Type II theories, the vertex operators must re- 
spect both the left- and right-moving supersymmetries. For the massless fields of 
the Type II theory, for example: 


V = €,,(0X" — kp? w")(AX” — ike web ye*. (22.62) 


Here € is subject to the constraint k“€,,, = 0. Depending on the symmetries of €, 
the vertex operator describes production of gravitons, dilatons, or antisymmetric 
tensor fields. It is straightforward to check that the coupling of three gravitons is 
that expected from the Einstein Lagrangian. 

In the Green—Schwarz formalism, it is Lorentz invariance which governs the 
form of the vertex operators. As in the covariant formulation, the vertex operators 
in the Type II theory are products of separate vertex operators for the left and the 
right movers, with e’** factors. These have the structure: 


Va = Cyy BY Br el®*, (22.63) 
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where 
Bi = axX'—RVki Bt = pt (22.64) 


and, from the light cone gauge condition, ¢“* = 0. Here, 
Ray sas. (22.65) 


In the Green—Schwarz approach, it is not more difficult to deal with vertex operators 
for fermions or for what we have, in the covariant formulation, called the R-R states. 
The polarizations, ¢,,,, are replaced by polarizations with one or two spinor indices. 
Then, as appropriate, one replaces the B“s with fermionic operators, F“ and F“. 
We will not give these here as we will not need them in the text, but they can be 
found in the references. In the covariant approach, more conformal field theory 
machinery is required to construct fermion emission operators. 


Suggested reading 


The superstring is well treated in various textbooks. Green et al. (1987) focus heavily 
on the light cone formulation; Polchinski (1998) focuses on the RNS formulation. 
Both provide a great deal of additional detail, including construction of vertex 
operators and S-matrices in the two formalisms. A concise and quite readable 
introduction to the problem of fermion vertex operators in the RNS formulation is 
provided by the lectures of Peskin (1987). 


Exercises 


(1) Consider the R-R sectors of the ITA and IIB theories, and study the objects 


ay Kony, 
Show that in the ITA case, only even-rank tensors are non-vanishing, while in the ITB 
theory only the odd-rank tensors are non-vanishing. Phrase this in the language of ten 
dimensions, rather than the eight light cone dimensions. To do this consider a particle 
moving along the 9 direction, and show that the Dirac equation correlates chirality in 
ten dimensions with chirality in eight. To do this, you may want to make the following 
choice of I’ matrices: 


Tr? = 02 @ Io; VT! =io, @ y'; T? = i103 @ Ne. (22.66) 


(2) Write the Green—-Schwarz Lagrangian in a superspace formulation. Show that Q is the 
supersymmetry generator expected in this approach. Construct the symmetry generated 
by Q*%, and show that this has the structure of a non-linearly realized (spontaneously 
broken) supersymmetry. Can you offer some interpretation? 
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(3) Verify that with the choice of Eq. (22.55), the zero modes of the Green—Schwarz oper- 
ators S“ obey the correct anticommutation relations. 

(4) Verify the expression for the partition function for the Type II theories. Show that it is 
modular invariant. Consider a different choice, which defines the type 0 superstring, 


[29° + [29° + [Za]? [Zt (22.67) 


If you like, verify that this is also modular invariant, but at least show that the spectrum 
does not include a spin-3/2 particle. 

(5) Verify that the operator product of two graviton vertex operators in the RNS formalism 
yields the correct on-shell coupling of three gravitons. Remember the gauge condition in 
this analysis. The three-graviton vertex in Einstein’s theory can be found, for example, 
in Sannan (1986). 
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The heterotic string 


In the Type II theory, we have seen that the left and right movers are essentially 
independent. At the level of the two-dimensional Lagrangian, there is a reflection 
symmetry between left and right movers. However, this symmetry does not hold 
sector by sector; it is broken by boundary conditions and projectors. 

In the heterotic theory, this independence is taken further, and the degrees of 
freedom of the left and right movers are taken to be independent — and different. 
There are two convenient world sheet realizations of this theory, known as the 
fermionic and bosonic formulations. In both, there are eight left-moving and eight 
right-moving X/s, associated with ten flat coordinates in space-time. There are 
eight right-moving two-dimensional fermions, y/. There is a right-moving super- 
symmetry, but no left-moving supersymmetry. In the fermionic formulation there 
are, in addition, 32 left-moving fermions which have no obvious connection with 
space-time, 4“. In the bosonic description, there are an additional 16 left-moving 
bosons. In other words, there are 24 left-moving bosonic degrees of freedom. There 
are actually several heterotic string theories in ten dimensions. Rather than attempt a 
systematic construction, we will describe the two supersymmetric examples. These 
have gauge group O(32) and Eg x Eg. The group Es, one of the exceptional groups 
in Cartan’s classification, is not terribly familiar to most physicists. However, it is 
in this theory that we can most easily find solutions which resemble the Standard 
Model. We will introduce certain features of Eg group theory as we need them. 
More detail can be found in the suggested reading. In this chapter, we will work 
principally in the fermionic formulation. We will develop some features of the 
bosonic formulation in later chapters, once we have introduced compactification of 
strings. 
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23.1 The O(32) theory 


The O(32)(SO(32)) theory is somewhat simpler to write down, so we develop 
it first. In this theory, the 32 44 fields are taken to be on an equal footing. The 
GSO projector, for the right movers, is as in the superstring theory. In the RNS 
formalism, in the NS sector, we keep only states of odd fermion number; similarly 
in the Ramond sector, where fermion number includes a factor e’"''. For the left 
movers, the conditions are different. Again, we have a Ramond and an NS sector. 
In the NS sector we keep states only of even fermion number. In the R sector, the 
ground state is a spinor of SO(32). The spinor representation can be constructed 
just as we constructed the spinor representation of O(8). Again, there are two 
inequivalent irreducible representations. There is a chirality, which we can call 133. 
The lowest spinor representation of definite chirality is the 32 768. Again, in the 
Ramond sector, we project (by convention) onto states of even “fermion number.” 

As for the superstring, there is a different light cone Hamiltonian for each sector. 
The right-moving contributions are just as in the superstring. The left-moving part 
includes a contribution from the bosonic operators, and a contribution from the 
fermions, 4“. As for the superstring, in the Ramond sector the As are integer 
moded; they are half-integer moded in the NS sector. From our formula, the left- 
moving normal ordering constant is —1. 

With this, we can consider the spectrum. Take, first, the NS—NS sector, i.e. the 
sector with NS boundary conditions for both the left and the right movers. The states 
are space-time bosons. The left-moving normal ordering constant is —1. Without 
AAs, the lowest mass states we can form are: 


a! W210). (23.1) 


From our discussion of the normal ordering constants, we see that these states are 
massless. They have the quantum numbers of a graviton, antisymmetric tensor, and 
scalar field. 

Using the left-moving fermion operators, we can construct additional massless 
states in this sector: 


122 1202 1/210). (23.2) 


These are vectors in space-time. Because the \“s are fermions, they are antisym- 
metric under A <> B. So they are naturally identified as gauge bosons of the gauge 
group SO(32). We will show shortly that they have the couplings of O(32) Yang— 
Mills theories. 

Let’s first consider the other sectors. In the NS-R sector, the right-moving states, 
w! 1 y2lP). are replaced by the states we labeled |a). Again, these must be massless, 
so we now have particles with the quantum numbers of the gravitino, one additional 
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fermion, and gauginos of O(32). In the NS—R and R-R sectors, however, it turns out 
that there are no massless states, as can be seen by computing the normal ordering 
constants. It is necessary to include, as well, the R sector for the left movers. Here 
the normal ordering constant is +1, and there are no massless states. 


23.2 The Eg x Eg theory 


The Eg group is unfamiliar to many physicists, and one might wonder how one 
could obtain two such groups from a string theory. To begin, it is useful to note 
that E's has an O(16) subgroup. Under this group, the adjoint of E's, which is 248- 
dimensional, decomposes as a 120 — the adjoint of O(16) — and a 128, a spinor of 
O16). 

In ten dimensions, we have seen we can build a sensible string theory with eight 
left-moving bosons and 32 left-moving fermions. So the strategy is to break the 
fermions into two groups of 16, 44 and 4, and to treat these as independent. 
This gives a manifest O(16) x O(16) symmetry, similar to the symmetry of the 
O(32) theory. There are now NS and R sectors for each set of fermions separately. 
The right-moving GSO projectors are as before. For the left movers, in each of 
the NS sectors, the left-moving projector is onto states of even fermion number. 
With a suitable convention for the I';; chirality, this is also true of the R sectors. 
So consider, again, the spectrum. In the NS—NS-NS sector, just as before, there 
are a graviton, antisymmetric tensor, and scalar field. We can also construct gauge 
bosons in the adjoint of each of the two O(16)s: 


SARE WE yel0): “RA AP ee pO): (23.3) 


Note that because of the projectors, there are no massless states carrying quantum 
numbers of both O(16) groups simultaneously. In the NS—NS-R sector, we find 
the superpartners of these fields. 

Now consider the R-NS-NS sector. Here the ground state is a spinor of the 
first O(16). So now we have a set of gauge bosons in the spinor 128-dimensional 
representation. Similarly, in the NS—R-NS sector, we have a spinor of the other 
O(16). These are the correct set of states to form the adjoints of two Egs. Again, 
establishing that the group is actually Eg x Eg requires showing that the gauge 
bosons interact correctly. We will do that in the following section. 

Finally, in the R-R—NS and R-R-R sectors, there are no massless states. 


23.3 Heterotic string interactions 


We would like to show that the states we have identified as gauge bosons in the 
heterotic string interact at low energies as required by Yang—Mills gauge invariance. 
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To do this, we work in the covariant formulation and construct vertex operators cor- 
responding to the various states. Consider the O(32) theory first. With our putative 
gauge bosons, we associate the vertex operators: 


| i a / d?zd4A(Z)A(Z) (8. X*"(z) — ikv wh w'(ze™*. (23.4) 


For the right movers, as in the Type II theories, we have required invariance under 
the right-moving world sheet supersymmetry. For the left-moving vertex operators, 
we have simply required that the operators have dimension one, so that overall 
the vertex operator has dimension one with respect to the left- and right-moving 
conformal symmetry (the operator is said to be (1, 1), just like those of the Type II 
theory). To determine their interactions, we study the operator product of two such 
operators. The left-moving part of the vertex operator is a current: 


G7 @ = M@a*@). (23.5) 
The operator product of two of these currents is: 
SAC 5BD AO ica 640) 8 (ZAP (D) 
(Z— wy (Z— w) 
An algebra of currents of this kind is called a “Kac—Moody algebra.’ It has the 
general form 


Via (4) ae (23.6) 


kse? abe +c( 
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where k is called the central extension of the algebra. In our case, k = 1. The {7s 
are the structure constants of the group. This is what we have found here. 

To see the Yang—Mills structure, it is helpful to use the general Kac-Moody form, 
denoting the currents, and the corresponding vertex operators, by a subscript a. In 
the operator product, we have seen from our discussion of factorization that the in- 
teraction is proportional to the coefficient of 1/|z — w|?. In the product V,(z)V,(w), 
the 1/(Z — w) is proportional to faye, just what is needed for the Yang—Mills ver- 
tex. The momentum and g,,, pieces arise from the right-moving operator product. 
In 


(OX (z) + kip? (zw (ze *OAX"(w) + kag UW (ww) ye *™ (23.8) 


the 1/(z — w) terms arise from various sources. One can contract the 0X factors in 
each vertex with the exponential factors. This gives 


pane ve (kt = ki) 


Iz — w/? 


Vive ~ (23.9) 
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Contracting the two 0X factors with each other gives two factors of z — w in 
the denominator. These can be compensated by Taylor expanding X(z) about w. 
Additional terms arise from contracting the fermions with each other. The details 
of collecting all the terms and comparing with the three gauge boson vertex are left 
for the exercises. 


23.4 A non-supersymmetric heterotic string theory 


One can verify the modular invariance of the heterotic string theory, with the GSO 
projections we have used, in precisely the same way as we did for the superstring 
theories. This raises the question: are there other ten-dimensional heterotic theories, 
obtained by combining the partition functions of the separate sectors in different 
ways? The answer is definitely yes. Several of these have tachyons, but one does 
not. Its gauge group is O(16) x O(16). It is most readily described in the Green— 
Schwarz formalism. This will also provide us with our first example of “modding 
out,” obtaining a new string theory by making various projections. 

In order to obtain the smaller gauge group, we need to get rid of the gauge bosons 
from Es which lie in the spinor representation. On the other hand, there is no harm 
in having the corresponding gauginos, if supersymmetry is broken. So we take the 
original Eg x Ex theory, and keep only states which are even under the symmetry 
(—1)* in spacetime and a corresponding symmetry in the gauge group (i.e. spinorial 
representations are odd, non-spinorial even). This immediately gets rid of: 


(1) the gravitinos, and 
(2) the gauge bosons which are in spinorial representations of the group. 


However, we have seen that, for consistency, it is important that string theories 
be modular invariant. Simply throwing away states spoils modular invariance; it is 
necessary to add in additional states. In the present case, one has to add a sector 
with different, twisted boundary conditions for the fields: 


Sa(o +7,T) = —S,(o, T). (23.10) 


For the gauge fermions there is a related boundary condition (this is more eas- 
ily described in the bosonic formulation which we will discuss in the chapter on 
compactification). 


Suggested reading 


The original heterotic string papers by Gross ef al. (1985, 1986) are remarkably 
clear. Polchinski’s book (1998) provides a quite thorough overview of these theories. 
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For example, for those who are not enamored of the Green—Schwarz formalism, it 
develops the non-supersymmetric O(32) in the RNS formalism in some detail. 


Exercises 


(1) Construct the states corresponding to the gauge bosons of EF x Eg. In particular, use the 
creation—annihilation operator construction of O(2N) spinor representations to build 
the 128s of O(16). 

(2) Verify that the algebra of O(32) currents is of the Kac-Moody form. To work out the 
structure constants, remember that the generators of O groups are just the antisymmetric 
matrices: 


Ong Harr? ao, (23.11) 


(3) Verify that, on-shell, the three-gluon vertex has the correct form. In addition to carefully 
evaluating the terms in the operator product expansion, it may be necessary to use 
momentum conservation and the transversality of the polarization vectors. 


24 


Effective actions in ten dimensions 


In ten dimensions, supersymmetry greatly restricts the allowed particle content and 
effective actions for theories with massless fields. Without gauge interactions, there 
are only two consistent possibilities. These correspond to the low-energy limits of 
the IIA and IIB theories. These have N = 2 supersymmetry (they have 32 con- 
served supercharges). Because the symmetry is so restrictive, we can understand a 
great deal about the low-energy limits of these theories without making any detailed 
computations. We can even make exact statements about the non-perturbative be- 
havior of these theories. This is familiar from our studies of field theories in four 
dimensions with more than four supercharges. In ten dimensions, supersymmetric 
gauge theories have N = 1 supersymmetry (16 supercharges). Classically, speci- 
fication of the gauge group completely specifies the terms in the effective action 
with up to two derivatives. Quantum mechanically, only the gauge groups O(32) 
and Es x Es are possible. 


24.0.1 Eleven-dimensional supergravity 


Rather than start with these ten-dimensional theories, it is instructive to start in 
eleven dimensions. Eleven is the highest dimension where one can write a super- 
symmetric action (in higher dimensions, spins higher than two are required). This 
fact by itself has focused much attention on this theory. But it is also known that 
eleven dimensions has a connection to string theory. As we will see later, if one 
takes the strong coupling limit of the Type IIA string theory, one obtains a theory 
whose low-energy limit is eleven-dimensional supergravity. 

The particle content of the eleven-dimensional theory is simple: there is a gravi- 
ton, guy (44 degrees of freedom) and a three-index antisymmetric tensor field, 
Cuno (84 degrees of freedom). There is also a gravitino, yy. This has (16 x 8) 
degrees of freedom. We have, as usual, counted degrees of freedom by considering 
a theory in nine dimensions, remembering that gy, is symmetric and traceless, 
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and that the basic spinor representation in nine dimensions is sixteen dimensional 
(it combines the two eight-dimensional spinors of O(8)). 

The Lagrangian for the eleven-dimensional theory, in addition to the Ricci scalar, 
involves a field strength for the three-index field, Cyyo. The corresponding field 
strength, Fynop, is completely antisymmetric in its indices, similar to the field 
strength of electrodynamics: 


3! 
Funop = 4 (OmCnop —dnCuopt:::) 


3! 
= FAD? auCwor. (24.1) 
“Pp 


where the sum is over all permutations, and the factor (—1)” is +1 depending on 
whether the permutation is even or odd. It is convenient to describe such antisym- 
metric tensor fields in the language of differential forms. For the reader unfamiliar 
with these, an introduction is provided later, in Section 26.1. For now, we note that 
antisymmetric tensors with p indices are p forms. The operator of taking the curl, 
as in Eq. (24.1), takes a p form to a p + 1 form. It is denoted by the symbol d, and 
is called the exterior derivative. In terms of forms, Eq. (24.1) can be written quite 
compactly as 


F=dC. (24.2) 


The theory has a gauge invariance: 
2 P 
C>C+dA Cuno > = S\(-D? du Ano (24.3) 
‘Pp 


where A is a two-form. 
We will not need the complete form of the action. The bosonic terms are: 


1 af 2k 
= ag V8 Fn eM Fu iy FMs...MgCMo=-Mi1- 
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(24.4) 
The last term is a Chern—Simons term. It respects the gauge invariance of Eq. (24.3) 
if one integrates by parts. Such terms can arise in field theories with odd dimensions; 
in 2+1-dimensional electrodynamics, for example, they play an interesting role. 
The fermionic terms include covariant derivative terms for the gravitino, as well as 
couplings to F and various four-Fermi terms. The supersymmetry transformation 
laws have the structure: 


1 
52 V8R 


Loos = “ped 


be4, = Sawa (24.5) 
fo) 
dAunpe = og Man be) (24.6) 


1 
6Wu = —Dun-+(F7 pieces). (24.7) 
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Here en is the “vielbein’” field, and the covariant derivative is constructed from the 
spin connection (discussed in Section 17.6). 


24.0.2 The ITA and IIB supergravity theories 


The eleven-dimensional fields are functions of the coordinates xo, ... , X19. We ob- 
tain the IIA supergravity theory (the low-energy limit of the Type IIA string) theory 
if we truncate the eleven-dimensional supergravity theory to ten dimensions, i.e. 
if we simply eliminate the dependence on x19. We need to relabel fields, as well, 
since it is not appropriate to have a 10 index. So we take the components of g 
with ten-dimensional indices to be the ten-dimensional metric. Then gj 19 is a 
ten-dimensional scalar, which we call @, and gio ,, is a ten-dimensional vector, 
which corresponds to the Ramond—Ramond vector of the IIA string theory. Note 
that C11 pv = Byy is atwo-index antisymmetric tensor field in ten dimensions (cor- 
responding to the two-index tensor we found in the NS—NS sector). The gravitino 
decomposes into two ten-dimensional gravitinos, and two spin-1/2 particles. With 
H = dB, the bosonic terms in the ten-dimensional action for the NS—NS fields are: 
ao pe ee ee 

2K? 4 He ~ VOR 

The IIB theory is not obtained in this way. But from string theory, we can see 
that the NS—NS in the action must be the same as in the Type IIA theory. This is 
because in the NS-NS sector, the vertex operators of the ITA and IIB theories are 
the same, so the scattering amplitudes — and hence the effective action — are the 
same as well. 


Li (3,.6/o)- (24.8) 


24.0.3 Ten-dimensional Yang-Mills theory 


From our studies of the heterotic string, we know the field content of this theory. 
There is a metric, an antisymmetric tensor field (which we again call B,,,,), a scalar 
@, and the gauge fields, Aj. The Lagrangian for g, B and ¢ is the same as in the 
Type II theories. The gauge terms are: 


1 
ey ee 5 "(Dik (24.9) 


It turns out that there is another crucial modification in the Yang—Mills case. The 
field strength Hy yo is not simply the curl of By, but contains an additional piece, 
which closely resembles the Chern—Simons term we encountered in our study of 
instantons in four-dimensional Yang—Mills theory: 


K 
Heaps (24.10) 
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(the notation will be thoroughly explained in Chapter 26), with 
1 2 
w3 = A“ F* — 38 fabcAt A’ AS = A“d AS + 38 Tae A AG (24.11) 


There is also a gravitational piece, with a similar form. 

This extra term plays an important role in understanding anomaly cancellation. 
In four dimensions, we will see that it leads to the appearance of axions in the 
low-energy theory. 


24.1 Coupling constants in string theory 


The Standard Model is defined, in part, by specifying a set of coupling constants. 
The fact that there are so many parameters is one of the reasons we have given 
that the model is not satisfactory as some sort of ultimate description of nature. 
In our discussion of string interactions, we have introduced a coupling constant, 
gs. There is one such constant for each of the string theories we have introduced: 
bosonic, Type I, Type ITA and Type IIB, and heterotic. But the idea that string theory 
possesses a free parameter is, it turns out, an illusion. By changing the expectation 
value of the dilaton field, we can change the value of the coupling. This is similar 
to phenomena we observed in four-dimensional supersymmetric gauge theories. In 
situations with a great deal of supersymmetry, there will be no potential, perturba- 
tively or non-perturbatively, for this field, and the choice of coupling will correspond 
to a choice of vacuum. But in vacua in which supersymmetry is broken, we would 
expect that dynamical effects would fix the value of this and any other moduli. 
The coupling constants of the low-energy theory would then be determined fully in 
ways which, in principle, one could understand and eventually hope to calculate. In 
the next few sections, we explain this connection between coupling constants and 
fields. 


24.1.1 Couplings in closed string theories 


When we constructed vertex operators, we saw that we could include a coupling 
constant, g,, in the definition of the vertex operator. In the heterotic string, the same 
coupling enters in all vertices. This is a consequence of unitarity. At tree level, for 
example, we saw that scattering amplitudes factorize near poles of the S-matrix. 
If one introduced independent couplings for each vertex operator, the amplitudes 
would not factorize correctly. As a result, all amplitudes can be expressed in terms 
of a single parameter. In the heterotic string theory, this means that there is a 
calculable relation between the gravitational constant and the Yang—Mills coupling. 
To work out this coupling, one needs to calculate the three-point interaction for three 
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gravitons, and for three gauge bosons carefully (see the exercises at the end of the 
chapter). The results are necessarily of the form: 


Kip = ag*(20')* gin = bg*(2a')?. (24.12) 


The calculation yields a = 1/4,b = 1. 
A similar analysis, in the Type I theory, gives a relation between the open string 
and closed string couplings, and between the gauge and gravitational couplings. 
In both theories, we see that the string scale is smaller than the Planck scale: 


M, = (g.)'/*Mp. (24.13) 


This is a satisfying result. It means that if we think of M, as the cutoff on the gravity 
theory, gravitational loops are suppressed by powers of gs. 


24.1.2 The coupling is not a parameter in string theory 


So far, in all of the string theories, it appears that there is an adjustable, dimensionless 
parameter. As we said earlier, this is not really the case. The reason for this traces 
to the dilaton. Classically, in all of the string theories we have studied, the dilaton 
has no potential, so its expectation value is not fixed. In the next two short sections, 
we will demonstrate that changing the expectation value of the dilaton changes 
the effective coupling. With enough supersymmetry, there is no potential for the 
dilaton, so the question of the value of the coupling is equivalent to a choice among 
degenerate vacuum states. Without supersymmetry (or with N < 1 supersymmetry 
in four dimensions), one does expect quantum mechanical effects to generate a 
potential for the dilaton, and the value of the coupling is a dynamical question. 


24.1.3 Effective Lagrangian argument 


Perhaps the simplest way to understand the role of the dilaton is to examine the 
ten-dimensional effective action. Start with the case of the heterotic string in ten di- 
mensions. We can redefine @ = g~*x/*¢@’, eliminating g everywhere in the action. 
Note that, since x « g, this means that ¢’ ~ g!/*. Then we can do a Weyl rescaling 
which puts a common power of ¢ in front of the action (dropping the prime on @): 


Suv = ’ | Suv (24.14) 


puts a common power of ¢ out front of the action: ¢~*. This is consistent with g 
being the string loop parameter, since we have effectively g~? at the front. 

With this rescaling, it is the string scale which is fundamental. Remember that 
M; = M?/(g’)!“. By rescaling the metric, we have rescaled lengths, which were 
originally expressed in units of M,, in terms of M,. So we have a consistent picture. 
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The cutoff for the effective Lagrangian is M,. All dimensional parameters in the 
Lagrangian are of order M,, and loops are accompanied by g? ~ ¢%. 


24.1.4 World sheet coupling of the dilaton 


Just as we can couple the graviton to the world sheet, we can couple the dilaton. 
The dilaton turns out to couple to the two-dimensional curvature: 
I 2 (2) 
Lo = — | @oVh®(X)R. (24.15) 
An 

In two dimensions, however, gravity is “trivial.” If we use our usual counting rules, 
the graviton has —1 degree of freedom. So the R® term should not generate any 
sensible graviton dynamics. If we go to conformal gauge, 


Nog = e? Nap (24.16) 
the curvature is a total divergence: 
R® = 0. (24.17) 


So at most this term in the action is topological. To get some feeling for this, 
let’s evaluate the integral in the case of a sphere. We have seen that one repre- 
sentation for the sphere is provided by the space CP!. This space has one com- 
plex coordinate. It is Kahler, which means that the only non-vanishing component 


of g iS 2-3: 


822 = (0,0; K (Z, Z)) (24.18) 
where, in this case: 
K = In(1 + 2z). (24.19) 
So 
1 2 
§= (, =) . (24.20) 
From this, we can read off ¢: 
@ = 2In(1 + 2z) = —2In (1+ 07 +0;) (24.21) 


so the integral over the curvature is: 


1 
re | d’od"(—2In(1+o7+0°)) =2. (24.22) 
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Note that this is invariant under a constant Wey] rescaling; it is topological. It is 
known as the Euler character of the surface, and satisfies: 
1 
ee / do VAR® (24.23) 
4a 
and 
x =2(1 - g). (24.24) 


In this expression, x is known as the Euler character of the manifold, and g is the 
genus. For the sphere, g = 0; for the torus, g = 1, and so on for higher-genus string 
amplitudes. So string amplitudes, for constant ®, come with a factor: 


eee (24.25) 


So we can identify e® with the string coupling constant. 


Suggested reading 


The ten-dimensional effective actions are described in some detail by Green et al. 
(1987). The couplings of the dilaton in string theory are discussed in detail by 
Polchinski (1998). 


Exercise 


(1) By studying the OPEs of the appropriate vertex operators, verify Eq. (24.12). To avoid 
making this calculation too involved, you may want to isolate particular terms in the 
gravitational and Yang—Mills couplings. The required vertices in general relativity can 
be found in Sannan (1986). 
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Compactification of string theory I. Tori and orbifolds 


We don’t live in a ten-dimensional world, and certainly not in a twenty-six- 
dimensional world without fermions. But if we don’t insist on Lorentz invariance in 
all directions, there are other possible ways to construct consistent string theories. 
In this chapter we will uncover many consistent string theories in four dimen- 
sions (and in others). If anything, our problem will shortly be an embarrassment 
of riches: we will see that there are vast numbers of possible string constructions. 
The connection of these various constructions to one another is not always clear. 
Many of these can be obtained from one another by varying expectation values of 
light fields (moduli). One might imagine that others could be obtained by exciting 
massive fields as well. In general, though, this is not known, and, in any case, the 
meaning of such connections in a theory of gravity is obscure. But before explor- 
ing these deep and difficult questions, we need to acquire some experience with 
constructing strings in different dimensions. 


25.1 Compactification in field theory: the Kaluza—Klein program 


The idea that space-time might be more than four-dimensional was first put for- 
ward by Kaluza and Klein shortly after Einstein published his general theory of 
relativity. They argued that, in this case, five-dimensional general coordinate invari- 
ance would give rise to both four-dimensional general coordinate invariance and 
a U(1) gauge invariance, unifying electromagnetism and gravity. In modern lan- 
guage, they considered the possibility that space-time is five-dimensional, with the 
structure M* x S'. This is, on first exposure, a bizarre concept, but its implications 
are readily understood by considering a toy model. Take a single scalar field, ®, in 
five dimensions. Denote the coordinates of M* by x”, as usual, and that of the fifth 
dimension by y, 


O<y <27R. (25.1) 
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Because y is a periodic variable, we can expand the field ® in Fourier modes: 


n 
——,(x)e"P” Dy = 


P(x, y) = Ri (25.2) 


L Te 
Taking a simple free field Lagrangian for ® in five dimensions, the Lagrangian, 
written in terms of the Fourier modes, takes the form: 


/ d‘xdyL=— / xdy; [(a¢)? + M*¢’)] 
-feeyh dub” + (M? + p2)¢’). (25.3) 


So, from a four-dimensional perspective, this theory describes an infinite number of 
fields, with ever increasing mass. In the gravitational case, symmetry considerations 
will force M = 0. If we set M = Oin our scalar model, we obtain one massless state 
in four dimensions (n = 0), and an infinite tower — the Kaluza—Klein tower — of 
massive states. If R is very small, say R ~ My | ‘the massive states are all extremely 
heavy. For the physics of the every-day world, we can integrate out these massive 
fields, and obtain an effective Lagrangian for the massless field. The effects of the 
infinite set of massive fields — the signature of extra dimensions — will show up in 
tiny, higher-dimension operators. So, in the end, finding evidence for these extra 
dimensions is likely to be extremely difficult. 

Having understood this simple model, we can understand Kaluza and Klein’s 
theory of gravitation and electromagnetism. The five-dimensional theory has the 
Lagrangian: 


1 
= saver. (25.4) 


Now there is an infinite tower of massive states, corresponding to modes of the 
five-dimensional metric: g,,, 8,4 and g44. Our principal interest is in the massless 
states, which arise from modes which are independent of y (we will need to refine 
this identification shortly). We expect to find a four-dimensional metric tensor, g,,,, 
a field which transforms as a vector of the four-dimensional Lorentz group, g4,, 
and a scalar: g44. There are various ways we can rewrite the five-dimensional fields 
in terms of four-dimensional fields. The physics is independent of this choice, 
but clearly some choices will be better than others. The most sensitive choice 
is that of the gauge field; we would like to choose this field so that its gauge 
transformation properties are simple. The general coordinate invariance associated 
with transformations of the fifth dimension, x4 = x4 + €4(x), is: 


Sud = Sua t Ou€4(x). (25.5) 
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This looks just like the transformation of a gauge field. So we adopt the conventions: 


gual Suv = Suv: (25.6) 
Note we are defining, here, a reference metric and measuring distances relative to 
that; we can take the basic distance to be the Planck length. Substituting this Ansatz 
back in the five-dimensional action, one can proceed very straightforwardly, work- 
ing out the Christoffel symbols and from these the various components of the cur- 
vature. Gauge invariance significantly constrains the possible terms. One obtains: 


Bus = Ay; Saa(x) =e 


27R a 1 ees 
L= 3,2 V8e (R)+ 4° Firs (25.7) 


So the theory, at low energies, consists of a U(1) gauge field, the graviton, and 
a scalar. The Lagrangian is not quite in the canonical form; usually one writes 
the action for general relativity in a form where the coefficient of the Ricci scalar 
(the “Einstein term’’) is field-independent. One can achieve this by performing an 
overall rescaling of the metric, known as a Wey] rescaling, 


Suv CO Bis (25.8) 
This introduces a kinetic term for the scalar: 
1 
L= —(R + 3/2(06)’). (25.9) 
2K 


The scalar field here is particularly significant. As it corresponds to g55, giving 
it an expectation value amounts to changing the radius of the internal space. In 
the Lagrangian, there is no potential for o, so at this level, nothing determines this 
expectation value. As in our four-dimensional examples, o is said to be a modulus. 
We now show that quantum mechanical effects generate a potential for o, already 
at one loop. This potential falls to zero rapidly as the radius becomes large. If there 
is aminimum of the potential, it occurs at radii of order one, where the computation 
is certainly not reliable. 

The calculation is equivalent to a Casimir energy computation in quantum field 
theory; one can think of the system as sitting in a periodic box of size 27 R, and 
asking how the energy depends on the size of the box. We can guess the form of 
the answer before doing any calculation. Since this is a one-loop computation, the 
result is independent of the coupling. On dimensional grounds, the energy density 
is proportional to 1/R*. 

To simplify matters, we will treat the gravitational field as a scalar field. At one 
loop: 


2 
= Dae 
I =Trln (-0 + =) 5 (25.10) 
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where we can do the calculation in Euclidean space. We can obtain a more manage- 
able expression by differentiating with respect to R. The trace can be interpreted 
now as a sum over the possible momentum states in four Euclidean dimensions, in 
a box of volume VT. Replacing the sum by an integral gives an explicit factor of 
VT; the coefficient is the energy per unit volume: 


aV d*p n> 
—_—= : 25.11 
aR (27)* R? 2s p? + (n?/R?) ( ) 


This can be evaluated using the same trick one uses to compute the partition func- 
tion in finite-temperature field theory (this is described in Appendix C). One first 
converts the sum into a contour integral, by introducing a function with simple 
poles located at the integers: 


av f d*p f dz 1 I 
OR J (Qn) J mi 22 + p21 — erik 


The contour consists of one line running slightly above the real axis, and one line 


27/R. (25.12) 


running below. Now deform the contour, so that the upper line encircles the pole at 
Z = ip, and the lower line encircles the pole at z = —ip. The resulting expression 
is divergent, but we can separate off a piece independent of R and a convergent, 
R-dependent piece: 


ave a’ pp? 1 
aR RJ Qny2p e2mpR — “| 
— el. + R-independent. (25.13) 
(27)* R> 


25.1.1 Generalizations and limitations of the Kaluza—Klein program 


So far we have considered compactification of a five-dimensional theory on a cir- 
cle, but one can clearly consider compactifications of more dimensions on more 
complicated manifolds. It is possible to obtain, in this way, non-Abelian groups. So 
one might hope to understand the interactions of the Standard Model. The principal 
obstacle to such a program turns out to be obtaining chiral fermions in suitable rep- 
resentations. The existence of chiral fermions in a particular compactification is a 
topological question. As one varies the size and shape of the manifold, it is possible 
that some fields will become massless; equivalently, massless fields can become 
massive. But fields which gain mass must come in vector-like pairs. Chiral fermions 
will not simply appear or disappear as one continuously changes the parameters of 
the compactification. Spinors in higher dimensions decompose as left-right sym- 
metric pairs with respect to four dimensions, but for suitable compactification mani- 
folds, it is possible to obtain chiral fermions. However, it turns out to be impossible 
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to obtain chiral fermions in the required representations of the Standard Model 
group. We will see, though, that string theory can generate both gauge groups and 
chiral fermions upon compactification. 


25.2 Closed strings on tori 


So far we have considered compactifications of field theories in higher dimensions. 
But general higher-dimensional field theories are non-renormalizable, and must be 
viewed as low-energy limits of some other structure. The only sensible structure 
we know in higher dimensions is string theory. At the same time, if string theory 
is to have anything to do with the world around us, it must be compactified to four 
dimensions. 

It is not complicated to repeat this analysis for the case of closed strings on 
circles, or more generally on tori. Consider first compactifying one dimension, X°, 
on a circle of radius 27 R. We require that states be invariant under translations by 
22 R. This means that the momenta, as in the field theory case, are quantized, 


p= —. (25.14) 


But now there is a new feature. Because of the identification of points, the string 
fields themselves (X°) need not be strictly periodic. Instead, we now have the mode 
expansion: 


XY =x? + pc + 2mRo + 5 > : (ae? + ee), (25.15) 
n#0 
where m is an integer. The states with non-zero m are called “winding” states. 
They correspond to the possibility of a string winding around, or wrapping, the 
extra dimension. Now the mass operator, in addition to including a contribution 
( py =n? / R?, includes, as well, a contribution from the windings, m? R? (if there 
is no momentum). If R is large compared with the string scale, these states are 
very heavy. At small R, however, these states become light, while the momentum 
(Kaluza—Klein) states become heavy. This reciprocity often corresponds, as we will 
see, to a symmetry between compactification at large and at small radius. 
Let’s focus on the various superstring theories. It is convenient to break up X? 
in terms of left- and right-moving fields: 


9 


9_* n i 1 4 ~in(t—c) 
= Z+(sgtmR) Cage ae age, 


\o 


9_ * n i 1 9 ~in(t+o) 
xg= 5+ (5 - mr) ea ea be “G5 
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It is then natural to define left- and right-moving momenta: 


n 
= — — —mR. 25.18 
PL= sa tm PRS oA! ( ) 


The world-sheet fermions are untouched by this compactification. The mass oper- 
ators are essentially as before, with p replaced by py, for the left movers and pr for 
the right movers: 


1 ae | : 
Lo=5PLt+N Lo= sPrtN. (25.19) 


Suppose we compactify on a simple product of circles. The left- and right-moving 
momenta form a lattice: 


n n 
PL = —+2m!'R! Pr =o 2m! R’. (25.20) 

Let’s determine the spectrum, focusing on the light states. Consider, first, the 
heterotic string, and to simplify the formulas, we take the O(32) case. The O(32) 
symmetry is unbroken. The original ten-dimensional gauge bosons, 


|AgP) = 4 288 Wm -1/2lP), (25.21) 


now decompose into a set of four-dimensional gauge bosons, corresponding (in 
light cone gauge) to M = 2, 3, and six scalars, M = I. The graviton, scalar, and 
antisymmetric tensor field now decompose as a set of scalars, g77, By, vectors 
8yi, By;,a four-dimensional graviton, g,,,, antisymmetric tensor, b,,,, and scalar, @. 

To understand space-time fermions, work in light cone gauge, and return to 
our description of O(8) spinors. Group the y-matrices into a set associated with 
the internal six dimensions, and one associated with the (transverse) Minkowski 
directions. In other words, instead of the four creation and annihilation operators, 
a’, ai, we group these into one set of three (labelled a', where now i = 1, 2, 3, and 
b, and their conjugates). So the 8,, which previously consisted of the states 


10) altat}0)  aMa*tatta*" 0), (25.22) 
now decomposes as: 
10) alta/t\0) bla/"0) bla a*ta"|0). (25.23) 


There are four states with no bs, and four with one b. These groups have opposite 
four-dimensional helicity. They can also be classified according to their transfor- 
mation properties under O(6). O(6) is isomorphic to the group SU(4). We have 
just seen that 8, = 4 + 4. We can also see that under the SU(3) subgroup of SU(4), 
the spinor decomposes as 


8=3434+1+4+1. (25.24) 
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Consider how the gravitino in ten dimensions decomposes under O(3, 1) x 
SU(4). We see that the gravitino consists of a set of spin-3/2 particles in the 4 
of SU(4), and their antiparticles. So, from the perspective of four dimensions, 
this is a theory with N = 4 supersymmetry. This is not really surprising since the 
ten-dimensional theory was a theory with 16 supercharges, and none of these are 
touched by this reduction to four dimensions. 

Because of the high degree of SUSY, one cannot write a potential for the scalar 
fields, g77, b;;, etc.; they are exact flat directions. If we redo our Casimir energy 
calculation, we will find that, because there is a fermionic state degenerate with 
every bosonic state, there is a cancellation. 

What do these moduli correspond to? Those which arise from the diagonal 
components of the metric correspond to the fact that the radii are not fixed. There is 
a string solution for any value of the R’. The off-diagonal components are related 
to the fact that the general torus in six dimensions is not simply a product of circles; 
there can be non-trivial angles. 

The massless scalars arising from the gauge bosons, A’, are also moduli. For 
constant values of these fields, there is no associated field strength, so they carry 
zero energy. But there are non-trivial Wilson lines: 


U, = elo ast ar, (25.25) 


Because of the periodicity, these are gauge-invariant, and correspond to distinct 
physical states. These moduli are often themselves called Wilson lines. 

The periodicities of a general N-dimensional torus can be characterized in terms 
of N basis vectors, el, a=1,..., N. The theory is defined by the identifications: 


Xx’ =X! +2nn*e!. (25.26) 


The set of integers define a lattice. To determine the allowed momenta, we define 
the dual lattice, with unit vectors: é/, satisfying: 


26 = Oo5, (25.27) 


a 


In terms of these, we can write the momenta for the general torus: 

p= nie, (25.28) 
while the windings are: 

w! = me!” (25.29) 


We can break these into left-moving and right-moving parts: 


pL =(p'/2+w') ph=(p'/2-w'). (25.30) 
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The lattice of left- and right-moving momenta, (p., Pr), has some interesting fea- 
tures. Thought of as a Lorentzian lattice it is even and self-dual. The “even” refers 
to the fact that the inner product of a vector with itself: 


pi — pe =2nm, (25.31) 


is even. The self-duality means that the basis vectors of the lattice and the dual are 
the same (Eq. (25.27)). 

In bosonic or Type II theories, these are are the most general four-dimensional 
compactifications with N = 8 supersymmetry. The different possible choices of 
torus define a moduli space of such theories. These moduli correspond to varying the 
metric and antisymmetric tensor fields. In the heterotic case, the four dimensional 
theory has N = 4 supersymmetry. Additional moduli arise from Wilson lines. As 
for the simple compactification on a circle, these are essentially constant gauge 
fields. A constant gauge field is almost a pure gauge transformation (take J fixed, 
for simplicity): 


Al =iel™ 4’ ale’ — iga! gt (25.32) 
but the gauge transformation is only periodic if A’ = 1/R;. In this case, the Wil- 


son line is unity. But we can do a redefinition of all of the charged fields, which 
eliminates the A’s: 


p=. (25.33) 


With this choice, charged fields are no longer periodic, but obey boundary 
conditions: 


PX!) = e2TRIA' By, (25.34) 
This means that the momenta are shifted: 


7 ft I 
pi=—+al, (25.35) 
Ry 
Shortly, we will see how all of the different momentum lattices can be understood 
in terms of constant background fields. 


25.3 Enhanced symmetries 


For large radius, the spectrum of the toroidally compactified string theory is very 
similar to that expected from Kaluza—Klein field theories. The principal new fea- 
ture, the winding states, is not important. At smaller radius, however, these states 
introduce startling new phenomena. We focus, first, on compactification of just one 
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dimension. Examining the momenta 


mm +nR TE oe (25.36) 
es n SS SH . 
PL aR PR OR 


we see that these are symmetric under R > 1/(2R). This symmetry is often called 
“T-duality.” It means that there is not a sense in which one can take the compact- 
ification radius to be arbitrarily small; it is our first indication that there is some 
sort of fundamental length scale in the theory. 7-duality is not a feature of the 
compactification of field theory; the string windings are critical. 

What is the physical significance of this symmetry? The answer depends on 
which string theory we study. Consider the heterotic string. We first ask whether 
duality is truly a symmetry, or just a feature of the spectrum. To settle this, we can 
check that it has a well-defined action on all vertex operators. Alternatively, we note 
that there is a self-dual point: Rog = 1//2. Examining Eq. (25.19) we see that, at 
this radius, various states can become massless. These include both scalars (from 
the point of view of the non-compact dimensions) and gauge bosons: 


eS Sle al); (25.37) 


Together with the U(1) gauge boson, the spin-one particles form the adjoint of 
an SU(2). We can check this by studying the operator product expansions of the 
associated vertex operators (see the exercises at the end of this chapter). 

Now we can understand the R > 1/R symmetry. At the fixed point, the sym- 
metry is an unbroken symmetry. It transforms: 


PL—> —PL PR PR. (25.38) 
In world sheet terms, this corresponds to a change of sign of 0 XL: 


From (25.37), Xz is the third component of isospin, T; so 73 — —T3 under 
T -duality. 

This transformation is a 90° rotation about the 1 or 2 axis in the SU (2) space, i.e. 
it is a gauge transformation! This means that the large and small radii not merely 
exhibit the same physics, they are the same. It also means that, provided the theory 
makes sense, the symmetry is an exact symmetry of the theory, in perturbation 
theory and beyond. As for any gauge symmetry, any violation of the symmetry 
would signal an inconsistency. 

Returning to the self-dual point, the momentum lattice at this point can be thought 
of as a group lattice, with the p:s labeling the SU (2) charges. Much larger symmetry 
groups can be obtained by making special choices of the torus, Wilson lines and 
antisymmetric tensor fields. 
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In other string theories the symmetry has a different significance. Consider the 
Type II theories; take the case of IIA for definiteness. Then since We > —We, the 
GSO projection in the right-moving Ramond sectors is flipped. So this transforma- 
tion takes the Type IIA theory to the Type IIB theory. In other words, the ITA theory 
at large R is equivalent to the HB theory at small R. 


25.4 Strings in background fields 


The possibilities for string compactification are much richer than tori, and we will 
explore them in this and the next chapter. We can approach the problem in two 
ways, each of which is very useful. First, we can examine the low-energy effective 
field theory which describes the massless modes of the string in ten dimensions, and 
look for solutions corresponding to large internal spaces. The effective action can 
be organized into terms with more and more derivatives. The spaces must be large 
in order that this use of the low-energy effective action makes sense. Alternatively, 
we can look for more direct ways to construct classical solutions in string theory. 
Both approaches have turned out to have great value. 

We will first formulate the string problem in a more general way. We want to 
ask: how do we describe a string propagating in a background which is not flat? 
The background might be described by a metric, Gyy, but it might also include 
an antisymmetric tensor, By, a dilaton, @, and, in the case of the heterotic string, 
gauge fields. Let’s first focus on the metric. Start with the bosonic string. It is natural 
to generalize the string action: 


1 
= | day X™ 8° X nun (25.40) 
20 
to 
1 
= i; aay X™" 0" X" G(X) uw. (25.41) 
IU 


From a world sheet point of view, we have replaced a simple free field theory with 
a non-trivial, interacting field theory. We can think of the X™s as fields which 
propagate on a manifold with metric Gyy. Often this space is called the “target 
space” of the theory; the Xs then provide a mapping from two-dimensional space- 
time to this target space. 

This looks plausible, but we can give some evidence that it is the correct pre- 
scription. Suppose, in particular, we consider a metric which is nearly that of flat 
space: 


Gun = un +hun. (25.42) 
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Substitute this form in the action, and examine the path integral for the field 
theory: 


VANE ho eer ee (25.43) 


Differentiating with respect to h brings down a vertex operator for the graviton. 
In other words, the path integral for this action is the generating functional for the 
graviton S-matrix. 

This observation suggests a general treatment for backgrounds for the massless 
particles 


A 
I= fa do(g1j0yX! + €°? By jy X'0gX"). (25.44) 
0 

The corresponding path integral generates the S-matrix elements for both the gravi- 
ton and the antisymmetric tensor field. But we would like to consider configurations 
which are not close to the flat metric with vanishing By,j. We can ask: what are 
acceptable backgrounds for string propagation? To answer this question, we need to 
remember that for the free string, conformal invariance was the crucial feature to the 
consistency of the picture. It was conformal invariance which guaranteed Lorentz 
invariance and unitarity. So we need to look for interacting two-dimensional field 
theories which are conformally invariant. 


25.4.1 The beta function 


Field theories of the type we have just encountered are called non-linear sigma 
models. In 1 + 1 dimensions, these are renormalizable theories: g77, By;, etc., are 
dimensionless. A priori, however, they are general functions of the fields, and there 
are an infinite — continuously infinite — set of possible couplings. 

Physically, the statement that these theories must be conformally invariant is the 
statement that their beta functions must vanish. To get some feeling for what this 
means, let’s consider a special situation. Suppose that B;; vanishes, and that the 
metric is close to the flat space metric: 


gun = Nun + / d°k hyy(kye*™. (25.45) 
The action is then: 


1 
I=— | do (r Fr Gere So hi yet x!0X!) (25.46) 
20 ; 
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We can treat the term involving / as a perturbation. Working to second order, 
we have: 


(f dz (Apy (ke * AX (z1)49X(z1)") x [hyo ede OX Cay OXC))). 
(25.47) 


Let’s write this simply as: 


/ d*z / d?z'hyOi(z1)h2O2(z2). (25.48) 


Ultraviolet divergences will arise in this integral when z; — Zp. In this limit, we 
can use the operator product expansion, 


C12; 

O1(21)O2(z2) = —L, Oj(z2) ++ (25.49) 
ales 2 

The integral over zp is ultraviolet divergent. If we cut it off at scale A~', we have 

the correction to the world sheet Lagrangian: 


i d°z hyhye12; O; In(A). (25.50) 


There is another divergence associated with the couplings h; and h2; this comes 
from normal ordering. In the case of the graviton vertex operator, if we simply 
expand the exponential factors and contract the xs, we obtain: 


: d?zh\(x)k* In(A). (25.51) 


Requiring, then, that the beta function for the coupling 4; vanishes, gives: 
hy + hohg ci3 = 0. (25.52) 


Recall, now, that c;;, is the three-point coupling for the three fields. So this is just 
the equation of motion, to quadratic order in the fields. 

This result is general. At higher orders, one encounters divergences of two types. 
First, there are terms involving a single logarithm of the cutoff, times more powers of 
the fields. Second, there are terms involving higher powers of logarithms. The higher 
powers are, from a renormalization perspective, associated with iterations of lowest 
order divergences, and they are systematically subtracted off in computing the beta 
functions. From a space-time point of view, these correspond to the appearance 
of massless intermediate states, which must be subtracted off in constructing the 
effective action or equations of motion. 

This procedure can be used to recover Einstein’s equations. A more elegant 
and efficient approach is to apply the background field method. For a general 
gravitational background, one can view X as a fixed background, which solves the 
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two-dimensional equations of motion, and study fluctuations about it. For a suitable 
choice of coordinates, the metric is second order in the fluctuations. One can include 
in this analysis background antisymmetric tensor fields and a background dilaton. 
The antisymmetric tensor can be analyzed along the lines of our analysis of h,,,. The 
dilaton is more subtle. In our action above, we omitted one possible coupling: the 
two-dimensional curvature. The dilaton couples to the world sheet fields through: 


/ Ao PR”, (25.53) 


The full analysis leads to the equations of motion: 


/ 
Buy = 0 = a'Ryy + 20'V,V, 0 — 7 Huw (25.54) 
/ 
pS ~5V" Hous 4 a'V°® Hoy + O(a’? (25.55) 
D-26 a 
po = —— - sve + a'V,0V°o — spl Hs: (25.56) 


It is possible to extend these methods to describe quantum corrections to the equa- 
tions, at least in the case of supersymmetric compactifications. 


25.4.2 More general tori 


As a first application, we consider the heterotic string theory in the case of more 
general tori. 

For general metric and backgrounds for both the antisymmetric tensor and gauge 
fields, one obtains a somewhat more involved expression for the momenta. A partic- 
ularly elegant way to derive this is to argue that constant background fields should 
effect only slow modes of the string. In the presence of background, constant metric 
and antisymmetric tensor fields, the action is: 

1 


I= = | arao [ (g770yX1 O° X! + €°F By dy X'0gX"). (25.57) 
TU 0 


To realize the notion of slowly varying fields, one makes the Ansatz: 
X! =q'(t)+20m! (25.58) 


where the second term allows for the possibility of winding. Substituting this back 
in the action and performing the integral over o: 


1 
[= fa (Seua'a’ +2B,,q'm! -2g,,n'n). (25.59) 
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Now we can read off the canonical momenta: 
Py = g17q7 +2Byj;m’. (25.60) 


In quantum mechanics, it is the canonical momenta which act by differentiation 
on wave functions, so it is the canonical momenta which must be quantized for a 
periodic system: 


Pr =n. (25.61) 
In terms of q’, this gives: 
g' = g'!m,;—2Bin’. (25.62) 
Finally, integrating this equation and substituting back into X’: 
X! =q'+2o0m!' + t(¢// ny —2Bim’). (25.63) 
From this, we can read off the left- and right-moving momenta: 


1 
pL =m' + 58) — g'/Byxm* 


1 
ae = —m' + 5809 — go Byxm*. (25.64) 


Once again, py: Py, — PR°- Pp is an integer; the lattice, thought of as a Lorentzian 
lattice, is even and self-dual. 

Including Wilson lines is slightly more subtle, because of their asymmetric cou- 
pling between left and right movers. For small A, the modification is essentially 
what we guessed above. There is also a modification of the internal, Eg charge 
lattice. 


25.5 Bosonic formulation of the heterotic string 


We have seen that in toroidal compactifications of string theory, new unbroken gauge 
symmetries can arise at particular radii. We have also seen that a toroidal compact- 
ification can be described by a lattice. So far, in describing the heterotic string, we 
have worked in what is known as the fermionic formulation. There is an alternative 
formulation, in which the 32 left-moving fermions are replaced by 16 left-moving 
bosons. 

It is an old result that two-dimensional fermions are equivalent to bosons; more 
precisely, two real, left-moving fermions are equivalent to a single real boson, and 
vice versa. The correspondence, for a complex fermion, 4, is: 


Mz) = el? , (25.65) 
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where ¢ is a left-moving boson. The equal sign here is subtle; at finite volume, 
care is required with the zero modes, as we will see. To be convinced that this 
equivalence is plausible, consider correlation functions at infinite volume. From 
our previous analyses of two-dimensional Green functions, we have: 


1 


ae 


(ACZ)ACw)) = (ee) (25.66) 
This suggests that, say, in the case of the S$ O(32) heterotic string, we can replace the 
32 left-moving fermions by 16 left-moving bosons. Note that this means, loosely, 
that we have 26 left-moving coordinates, as in the bosonic string (but still only 10 
right-moving bosons). At finite volume (i.e. 0 < o <7), we can write the usual 
mode expansions for these fields: 


| es 
Xf = spi + 5 ee (25.67) 


Now the pis are elements of the group lattice. Modular invariance requires that the 
lattice be even and self-dual. In 16 dimensions, there are two such lattices, those of 
O(32) and Eg x Es. 

The bosonization of fermions which we have described here is useful for the right- 
moving fields as well and also for the fermions of the Type I theories. We have 
avoided discussing space-time supersymmetry in the RNS formalism because the 
fermion vertex operators and the supersymmetry generators must change the bound- 
ary conditions on two dimensional fields. But in this bosonized form, this problem 
is simpler. Once again, we have relations of the form 


Wiw~ el®, (25.68) 


The @s live on a torus, whose “momenta” describe both N and RS states. Operators 
of the form e? change NS to R states, i.e. they connect bosons to fermions. This 
connection allows construction of fermion vertex operators and the supersymmetry 
generators. 


25.6 Orbifolds 


Toroidal compactifications of string theory are simple; they involve free two- 
dimensional field theories. But they are also unrealistic. Even in the case of the 
heterotic string, they have far too much supersymmetry and their spectra are not 
chiral. There is a simple construction which reduces the amount of supersymmetry, 
yielding models with interesting gauge groups and a chiral structure. The corre- 
sponding world sheet theories are still free, so explicit computations are straight- 
forward. These constructions are also interesting in other ways. They correspond 
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Fig. 25.1. Torus that admits a Z3 symmetry, allowing an orbifold construction. 


to particular submanifolds of the moduli space of larger classes of solutions. They 
exhibit interesting features like discrete symmetries and subtle cancellations of 
four-dimensional anomalies. At low orders, it is a simple matter to work out their 
low-energy effective actions. Through a combination of world sheet and space-time 
methods, one can understand their perturbative and in some cases non-perturbative 
dynamics. 

In this chapter, we will work out one example in some detail. Other examples can 
be studied in a similar way. We will also mention some other free field constructions 
of interesting string solutions. 

We start with a toroidal compactification on a particular lattice, a product of 
three tori as shown in Fig. 25.1. It is convenient to introduce complex coordinates, 


Gas hi gar +i 2 ax fix’. (25.69) 


This lattice is invariant under a Z3 symmetry 


op (25.70) 
This can be seen by carefully examining the figure. The lattice vector (1, 0), for 
example, in the original cartesian coordinates is rotated into the lattice vector 
(—1/2,1/ /2). This is related by translation by a lattice vector to (1/2, 1/ af 2) 
Now we identify points under the symmetry, i.e. two points related by asymmetry 
transformation are considered to be the same point. The result is almost a manifold, 
but not quite. There are three points which are invariant under the symmetry. These 
are called “fixed points.” These are the points: 


(0, 0); (1/2, V3/2); (1, V3). (25.71) 


The geometry near each of these points is singular. If one parallel transports about, 
say the point at the origin, after 120°, one returns where one started. The space 
is said to have a deficit angle. It is as if there was an infinite amount of curvature 
located at each of the points. Such a space is called an “orbifold.” 
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In quantum mechanics, requiring such an identification of points under a symme- 
try means requiring that states be invariant under the quantum mechanical operator 
which implements the symmetry. Consider the various states of the original ten- 
dimensional theory. In the Type II theory, for example, in the NS—NS sector, we 
have the states, before making any identifications: 


We oP 1/210) WoW! 1/210) WW! 1/210) (25.72) 

Ve ipWir2l0) iw! 210). (25.73) 

After the identification, the first set of states are invariant; the latter are not. These 

states all have simple interpretations. The first are the four dimensional graviton, 

antisymmetric tensor and dilaton. The second are the moduli of the torus. The parts 

symmetric under i —> j correspond to the metric components in the original theory 

gj. The antisymmetric parts correspond to the corresponding components of B;;. 

The diagonal components, g;,7, are easily understood. Changing slightly the value 

of these components correspond to changing the overall radius of the ith torus. This 

does not change the symmetry properties. The off-diagonal components, g15, etc., 

correspond to deformations which mix up the three planes, but leave a lattice with 

an overall Z3 symmetry. 

To understand what happens to the supersymmetries, focus on the gravitino. It 

is convenient to work in light cone gauge, and to decompose the spinors as we did 

earlier. To determine how the spinors transform under the Z3, we need to decide 

how the state we called |0) transforms under the symmetry. Consider a rotation, 
say in the 12 plane, by 120°. The rotation generator is: 


i 
Sp = {in — ny) = atta'+1/2. (25.74) 


So the rotation of the state |0), is described by 


e620) =e 6 |0), (25.75) 
The transformations of the other states can then be read off from the transformation 
laws of the a's: 


10) > es" |0) ai|0) > e al0). (25.76) 


Now we have to be a bit more precise about the orbifold action. This is a product 
of Z3s for each of the planes. But we see that acting on fermions, the separate 
transformations are Z¢s. In order that the group action be a sensible Z3, we need 
to take, for example: 


—4Ari 


Gizes? Paser sr Zxeee. (25.77) 
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With this definition, the fermion component 0 < |0) is invariant under the orbifold 
projection. The components i < a'|0) are not. 
We can label the gravitinos: 


Vou Va Via Whe (25.78) 


After the projection, instead of eight gravitinos, as in the toroidal case, there are 
only two; we have N = 2 supersymmetry in four dimensions. 

In addition to projecting out states, we need to consider a new class of states. 
We can consider closed strings which sit at the fixed points. More precisely, in 
addition to the strict periodic boundary condition, we can consider strings which 
satisfy: 


X'o +m) =e Xi(o). (25.79) 


These boundary conditions do not permit the usual bosonic zero modes. Instead, 
we have a mode expansion: 


i_y! i i 2i(n—4)(o—t) 1 wi, 28(n—-4)(0 41) 
xX'= Xa) + 5 ss of, 1¢ 3 + a, _1e 3 ; (25.80) 
n 


The mode numbers are now fractional; the absence of a momentum term indicates 
that the strings sit at the fixed points (labeled by a). In this case, there are 27 fixed 
points. For the fermions, we again have to distinguish Ramond and Neveu-Schwarz 
sectors. In the NS sectors, the fermions have modes which differ from integers by 
multiples of 1/2 — 1/3 = 1/6: 


wi = Pe v Lege oe @) (25.81) 


with a similar expansion for w. 
We can readily work out the normal ordering constant, using the formula we 
wrote earlier (Eq. (22.30)). We have, in the NS—NS sector: 


a=6~x 7/3 x 2/3)-6x 41/6) x (5/6) —4 x 7/2) x (1/2) =0. 
(25.82) 
So the ground state is massless in the twisted sectors. Again, because of N = 2 
supersymmetry, there can be no potential for this field. So there is a modulus in each 
twisted sector. Unlike the moduli in the untwisted sector, this modulus does not 
correspond to simply changing the features of the torus which defines the orbifold. 
Instead, it represents a deformation which, from a space-time viewpoint, smooths 
out the orbifold singularity. The resulting smooth space is an example of a Calabi- 
Yau manifold, of the type we will discuss in the next chapter. 
Let’s turn to the heterotic string theory on this orbifold. We will take the same 
projector on the spatial coordinates, X', as before. As a result, there is only one 
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gravitino; the four-dimensional theory has N = 1 supersymmetry. The moduli are 
in one to one correspondence with the scalars of the NS—NS sector of the N = 
2 theory: g;;, Bj;,@. We can also make a projection on the world sheet gauge 
degrees of freedom. There are many possible choices of this gauge transformation; 
the principal restriction comes from the requirement of modular invariance. A 
particularly simple one is almost symmetrical between the left and right movers. 
In the fermionic formulation it works as follows. Take Eg x Eg for definiteness. 
Of the 16 fermions in the first Eg single out 6, and rewrite them in terms of three 
complex fermions, A’. Call the remaining ten fermions A“. Now, in the projection, 
require invariance under 


ZisvetZ pioetwi wt oeFa, (25.83) 


In the untwisted sector, this projection has no effect on the graviton or the moduli 
which we have identified previously. But consider the various gauge fields. In ten 
dimensions, these were vectors in the adjoint of the two Egs and their fermionic 
partners. The fields with space-time indices in the internal dimensions now appear as 
four-dimensional scalars. In order that they be invariant under the full projection, 
it is necessary to choose their gauge quantum numbers appropriately. In the NS 
sector for each of the E's, the invariant states include the following. 


(1) A set of fields in the adjoint of Es, Eg and an SU(3). Of these, an O(10) subgroup 
of the E¢ is manifest in the NS-NS-NS sector, as well as an O(16) subgroup of Eg. 
Correspondingly, the gauge bosons are: 


A 0? 0 210) (25.84) 
in O(10), 
AA oe We 1/2l0) (25.85) 
in O(16), and, in SU(3) x U(1): 
een ate Oe (25.86) 


Note that all of these states are invariant. The U(1) is actually one of the E'¢ generators. 
E6 has an O(10) x U(1) subgroup under which the adjoint representation, which is 
78-dimensional, decomposes as: 


78 = 450 + lo + 16-12 + 161/2. (25.87) 


The remaining E¢ gauge bosons are found in the R-NS-NS sector. The left-moving 
normal ordering constant vanishes. The ground states in this sector are spinors of O(10), 
the 16 and 16 above. The 248-dimensional representation of the second Es is filled out 
as in the uncompactified theory. 
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(2) Matter fields. These lie in the fundamental representation of E,¢, the 27 under O(10). 
The 27 decomposes as: 


27 = 1_2 + 10; + 16_1/2. (25.88) 
There are nine 10s in the untwisted sectors, corresponding to the states: 
42 10M ow! 1210) (25.89) 


Each of these is one real scalar; we can use the conjugate fields to form nine more real 
scalars, or eight complex scalars. There are nine singlets of charge —2: 


pA pW ilo) (25.90) 
The 16s come from the R-NS-NS sector. 


So we have nine 27s from the twisted sectors, and no 27s; the theory is chiral. 

Let’s turn now to the twisted sectors. In the Type II case we found moduli in each 
sector. Here we will find moduli, additional 27s, and more. We first need to compute 
the normal ordering constants. For the right movers, the calculation is exactly as 
in the Type II theory, and gives zero. For the left movers in the NS—NS sector, we 
have: 


ae 6/4 1/3 2/3 a : : 
a= =F + 6/4) x 0/3) x 2/3) - 2 x (= +2) 


24° 4 
16 10 6 
+34 ~ Gh — gx 1/9) x 6/6) 
SoH (25.91) 


where the first two terms comes from the bosons, the next two from the fermions 
in the unbroken Eg, and the last terms from the fermions in the broken Ex. So we 
can make massless states in a variety of ways. 


(1) We can have 10s of O(10): 
A 1/210) twist: (25.92) 


Note that E¢ invariance requires that this state have U(1) charge +1. 
(2) Singlet of O(10) with U(1) charge —2: 


UEP seh sel OVesas (25.93) 


Together with a set of spinorial states from the R—NS sector, this completes a 27 of Eg. 
(3) Moduli, other gauge singlets: 


or! 1342 1/610). (25.94) 


If we contract the i and j index, we find the analog of the twisted sector modulus we 
had in the Type II theory. The other states represent additional singlets. 
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All together, then, we have found 9 + 27 = 36 copies of the 27 of E.¢, and 36 
moduli. Each 27 comfortably accommodates a generation of the standard model, 
plus an additional vector-like set of fields. So, while this example is hardly realistic, 
it is interesting: it predicts a particular number of Standard Model generations, 
plus additional fields. Whether variants of these ideas can lead to something more 
realistic is an important question, which we will postpone for the time being. 


25.6.1 Discrete symmetries 


One of the unappealing features of supersymmetric models as theories of nature is 
the need to postulate discrete symmetries in order to have a sensible phenomenology. 
This seems rather ad hoc. One of the features of the orbifold construction we just 
described is that a variety of discrete symmetries appear naturally. This phenomenon 
is common, as we will see, in string constructions. Here it is particularly easy to 
exhibit the symmetries. 

We have, for simplicity, considered a particular form for the torus — a particular 
point in the moduli space, at which the six-dimensional torus is a product of three 
two-dimensional tori. But at this point (really a surface), there is a large symmetry. 
First, there is a separate Z3 symmetry for each plane. (You can check that each plane 
in fact admits a Z, symmetry.) Because of the orbifold projection, one of these acts 
trivially on all states, but two are non-trivial. If we take the size of each of the three 
two-dimensional tori to be the same, we also have a permutation symmetry, $3, 
among the tori. 

The Z3s are R symmetries. We have already seen that the spinor with index 0 
rotates by a phase, e © ,under the symmetry. By definition, this is an R transforma- 
tion. This has significant consequences for the low-energy theory, greatly restricting 
the form of the superpotential. 

As an example of the far-reaching consequences of such symmetries, one can 
show that there are exact flat directions involving the matter fields. Consider the 
untwisted moduli. One can give expectation values to the O(10) 10 and 1 in one 
multiplet in a way which respects supersymmetry. Specifically, consider the field, 
o, corresponding to 


@ = 849011210) (25.95) 


and the corresponding singlet. Both of these are neutral under the rotation in the 
second plane. So one can not construct any superpotential term involving ¢ alone. 
One can give an expectation value to the singlet and to the 10 so as to cancel the D 
terms for E¢. The main danger, then, is a superpotential term of the form: 


W=v¢" (25.96) 
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with W some other 27. This is E¢ invariant (in terms of O(10) representations, it 
involves a product of a singlet and two 10s). But no such term is allowed by the 
discrete symmetries. 

So this simple argument shows that the moduli space is even larger than we 
might have thought. Such symmetries, as they forbid not only certain dimension- 
four but also dimension-five operators, might also be important to understanding 
the problem of proton stability and other important phenomenological issues. 

The model possesses other symmetries as well. There is Z3 symmetry under 
which the twisted sector states transform but the untwisted sector states do not. We 
will not derive this here, but itis plausible, and can be shown readily if one constructs 
the vertex operators for the twisted states. Many of the discrete symmetries of the 
model are subgroups of the Lorentz symmetry of the original higher-dimensional 
theory. As such, they can probably be thought of as gauge symmetries. This is less 
obvious for other symmetries, but it is generally believed that the discrete sym- 
metries of string theory all have this character. Searches for anomalies in discrete 
symmetries, for example, have yielded no examples. 

One could ask: why would nature choose a point in the moduli space of some 
string theory at which there is an unbroken discrete symmetry? At the moment, our 
understanding of how to connect string theory to nature is not good enough to give 
a definite answer to this question, but, at the very least, such points are necessarily 
stationary points of the effective potential for the moduli; at the symmetric point, 
the symmetry forbids linear terms in the action for the charged moduli. 


25.6.2 Modular invariance, interactions in orbifold constructions 


As in our original string theory constructions, there seems much which is arbitrary 
in the choices we made above. We also did not spell out what are the appropriate 
GSO projectors. As for the simple ten-dimensional constructions, the possible GSO 
projections are constrained by modular invariance. We will leave for the exercises 
checking some particular cases, but the basic result is easy to state. One can project 
by any transformation, provided that it has sensible action on fermions and on spinor 
representations of the gauge group, and provided that one has “level matching” in 
all of the twisted sectors. This means that one must be able to construct an infinite 
tower of states in each sector. To understand the significance of this statement, 
consider a different choice of group action than that we considered above. Instead 
of twisting by (1/3, 1/3, —2/3), project by (1/3, —1/3, 0). In this case, for example, 
in the NS—NS-NS sector, the left-moving normal ordering constant is —13/18. As 
aresult, one cannot construct any states in the twisted sector which satisfy the level 
matching condition. 


25.7 Effective actions in four dimensions for orbifold models 395 


There are other constructions of compactifications with N = 1 supersymmetry 
based on free fields. These include models based purely on free fermions. These 
models are believed equivalent to orbifold models in which one mods asymmetri- 
cally on the left- and right-moving fields. The latter, “asymmetric orbifold,’ models 
are interesting in that they potentially have very few moduli. In order to have sen- 
sible, unbroken discrete symmetries acting on the left and right, the original lattice 
typically must sit at a self-dual point. So many moduli are fixed — they are projected 
out by the orbifold transformation. It is not difficult, in this way, to construct mod- 
els where there are no moduli neutral under space-time symmetries, except for the 
dilaton. 


25.7 Effective actions in four dimensions for orbifold models 


While string theory provides a very explicit set of computational rules, at least for 
low orders of perturbation theory, these rules are complicated and rather cumber- 
some. Moreover, except in some special circumstances, we lack a non-perturbative 
formulation of the theory. Effective field theory methods have proven extremely 
useful in understanding the dynamics of string theory, both perturbative and non- 
perturbative. In this section, we will work out the effective action for the orbifold 
models introduced above. More precisely, we work out the Lagrangian for a subset 
of the fields, up to and including terms with two derivatives. Many of the features 
of these Lagrangians will be relevant to the more intricate Calabi- Yau compactifi- 
cations we will encounter shortly. 

In principle, to calculate the effective action, we should calculate the string S- 
matrix, and write an action for the massless fields which yields the same scattering 
amplitudes. Alternatively, we can calculate the equations of motion from the beta 
function and look for an action which reproduces these. But for low order terms 
in the derivative (a@’) expansion, for the fields in the untwisted sector, there is a 
simpler procedure. We know the form of the ten-dimensional effective action; we 
can simply truncate the theory to four dimensions. To do this, we start by setting 
all of the charged fields to zero (this includes the gauge fields). We also work at a 
point with a large discrete symmetry: Zs /Z3 x S3. We set all of the fields which 
transform under these symmetries to zero. This includes all of the moduli, except 
the overall size of the torus and its superpartners. We then write the metric as: 


ene") en) Serr 78 (25.97) 


With this parameterization, we are describing the size of the space with respect to 
a reference metric. We make a similar ansatz for the antisymmetric tensor: 


bjj(x") = —bji(x") = bo (x"))8;;- (25.98) 
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We must keep also the four-dimensional metric components, g,,, the scalar field 
@, and the antisymmetric tensor, B,,,. Take all of them to be functions of x, the 
uncompactified coordinates, only. Substituting these fields in the ten-dimensional 
Lagrangian, Eq. (24.8), the integral over the six internal coordinates is easy, since 
all fields are independent of the coordinates. One simply obtains e*?“) from the 
,/g factor. This is just the volume of the internal space, if o is constant. There are 
additional factors of e~° coming from the factors of the inverse metric: one from 
the four-dimensional pieces of the Ricci curvature; one from the kinetic term for 
@, and three from the H/,,, terms. The ten-dimensional curvature term also gives 
derivative terms in o. After a short computation, we obtain 


1 9 
f= aia — 3e°° a,00"o — ec I.Pd" b/g? 
9 3 
- 50 b a,ba"b - go OH Hyp. (25.99) 
It is customary to rescale the metric so that the Einstein term has the standard form: 


aN a ie (25.100) 


After this Weyl rescaling, the action becomes: 


pvp" 


1 9 3 3 
— __ Rp _ Be im 2 - 4-20 4-3/2 2 2 4-3/2 60 772 
L£= 5k 30,00"0 160? P0.b)/d 7° 0) 0b 4? eH 
(25.101) 


It should be possible to cast this Lagrangian as a standard four-dimensional, N = 1 
supergravity Lagrangian, with a particular Kahler potential. Having thrown away 
all but a few moduli, there is no superpotential. To determine the Kahler potential, 
we first note that, in four dimensions, an antisymmetric tensor field is equivalent to 
a scalar. This follows from counting degrees of freedom; with our usual rules, an 
antisymmetric tensor in four dimensions has only one degree of freedom. To make 
this explicit, one performs a “duality transformation” (the word is starting to seem 
a bit overused) 


Oe" Fy pe) Seni aa): (25.102) 


The field a is often called the “model-independent axion,” because it couples like 
an axion and its features do not depend on the details of the compactification. Then 
we define two chiral superfields, whose scalar components are: 


S =e 94 + 3iV2a (25.103) 
and 


T = e°g?!* —iv2b. (25.104) 


25.7 Effective actions in four dimensions for orbifold models 397 


Choosing the Kahler potential: 
K = —1In(S + S*) — 31n(T + T*) (25.105) 


reproduces all of the terms in Eq. (25.101). The reader may want to check all of 
the terms in this equation carefully, but at the least it is good to make sure one 
understands how the o and @ dependences are reproduced. 

Let’s now return to the ten-dimensional gauge field terms, Eq. (24.9). This will 
allow us to include the matter fields as well as the gauge fields. Rather than consider 
the full set of fields, we can restrict ourselves to the set which is invariant under 
each of the separate Z3s, combined with three separate Z3s in the gauge group 
(al => es al ). This leaves us with three complex scalars, C’, corresponding to the 
states 


Car lO) (25.106) 


(i is not summed). From the point of view of ten dimensions, these are Ai We 
also need to include the four-dimensional gauge fields, A . In this way we obtain 
the additional terms, after the Weyl rescaling: 


1 = ; 
Ds ce ae ae —3e°°¢ 74 D,C™ DYC! +--+. (25.107) 


This can still be put in the standard supergravity form. First, we need to remember 
that in the duality transformation, H,,,, now includes the Chern—Simons terms. Then 
it is necessary to modify the definition of T to include a contribution from the C 
fields: 


T = e°¢°* —i/2b4+ C*C! (25.108) 
and to modify the Kahler potential: 
K = —In(S + S*) — 31n(T + T* — C*C). (25.109) 


There is also a coupling of the field S to the gauge fields: 
1 
Ly= —75We. (25.110) 


This includes a coupling of @ and o to F a already apparent in Eq. (24.9). The 
aF F coupling arises from the Chern—Simons term in Eq. (24.10). Recall that 
Ap = Ay Brp| — Oprp- (25.111) 


So f d*x H*, using the definition of a and integrating by parts, gives an aF F 
coupling. Finally, there is a superpotential cubic in the C fields. 
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25.7.1 Couplings and scales 


It is worth pausing to note the connections between the couplings and scales in dif- 
ferent dimensions. We’ ll focus first on the heterotic string. We see from Eq. (25.110) 
that S determines the gauge coupling, S = 1/g”. This is as we would naively ex- 
pect. The ten-dimensional gauge coupling is essentially 1/g?; when we reduce to 
four dimensions, the four-dimensional gauge fields correspond to modes which are 
constant on the internal manifold, so 


1 


f 
5 = —=VM. (25.112) 
&4 &s 


In terms of the fields we defined above, V = e?”. 

These simple formulas pose a serious problem for the application of weakly 
coupled heterotic string phenomenology. If we simply identify S with the four- 
dimensional coupling, then the string coupling satisfies: 


g. = 8iV ME. (25.113) 


So we see that large volume, the limit in which an a’ expansion is valid, conflicts 
with small g, if g4 is fixed. We can also write a relation between the string scale 
and the Planck scale in four dimensions: 


M, = MSV8,”. (25.114) 


Solving for M, and substituting in the previous expression, gives an expression for 
gs which is incompatible with weak coupling, if we assume that V = Mocr- 

Later, we will sharpen this strong coupling problem, and consider possible 
solutions. 


25.8 Non-supersymmetric compactifications 


So far, we have considered supersymmetric compactifications. This is not necessary, 
but we will see that non-supersymmetric compactifications raise new conceptual 
and technical problems. 

Perhaps the simplest non-supersymmetric compactification is Scherk—Schwarz 
compactification. Here one compactifies the theory (this can be Type I, Type U, or 
heterotic) on a torus. In one of the directions, say the ninth direction, one imposes the 
requirement that bosons should obey periodic boundary conditions, and fermions 
anti-periodic ones. One can describe this by taking the radius of the extra dimension 
to be 2 x 27 R, and performing a projection by 


PS(aiy ee ee (25.115) 
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This projection eliminates, for example, the massless gravitinos; there is no super- 
symmetry, and there is no Bose—Fermi degeneracy in the spectrum. Indeed, in the 
simplest version, there are no massless fermions at all. 

As a result, the usual Fermi—Bose cancellation of supersymmetry does not take 
place, and at one loop, there is a non-zero vacuum energy. More precisely, there 
is a potential for the classical modulus R. The calculation of this potential is just 
the Casimir calculation we encountered earlier. Only the massless ten-dimensional 
fields contribute; the massive string states give effects which are exponentially 
suppressed for large R. To see this, one can return to our earlier calculation with 
a massive state (one of the oscillator excitations of the string). Replacing the sum 
over integers by an integral in the complex plane and deforming the contour, as 
before, yields a term exponentially small in the mass. The detailed results depend 
on the particular model, but typically the potential is negative and goes to zero at 
large R. In other words, at one loop, the dynamics tends to drive the system to small 
R. It is not well understood how to study the system beyond one loop. 

One can obtain non-supersymmetric theories in four dimensions in many other 
ways. For example, one can compactify the non-supersymmetric O(16) x O(16) 
heterotic theory ona torus (it can also, in some cases, be compactified on the Calabi— 
Yau spaces discussed in the next chapter). Again, one obtains a one loop potential 
for the moduli, and the theories are difficult to interpret at the quantum level. 
Compactifications of the Scherk—Schwarz theories and the non-supersymmetric 
theories are often related by 7 -dualities. 


Suggested reading 


An introduction to Kaluza—Klein theory prior to the development of string theory 
is provided in the text Modern Kaluza—Klein Theories by Appelquist et al. (1985). 
More thorough discussions of aspects of string compactification are provided by 
the texts of Green ef al. (1987) and Polchinski (1998). Many of the original papers, 
particularly the orbifold papers, are highly readable; see, for example, Dixon et al. 
(1986). There are many topics here we have only touched on in this chapter. We gave 
an argument that vanishing of the beta function of the two-dimensional o model is 
equivalent to the equations of motion in space-time, but readers may wish to work 
through the background field analysis which leads to Einstein’s equations. This 
is described in Polchinski’s book and elsewhere. The bosonic formulation of the 
heterotic string is also well described there, but the original papers are quite read- 
able (Gross et al. 1985, 1986). Bosonization and space-time supersymmetry in the 
RNS formulation are thoroughly discussed by Polchinski; a clear, and rather brief, 
introduction, is provided by Peskin’s 1996 TASI lectures (Peskin, 1997). The non- 
supersymmetric compactification described here was introduced by Rohm (1984). 
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Exercises 


(1) Derive the gauge terms in the Lagrangian of Eq. (25.7). You can do this by taking the 
metric to be flat. 

(2) Derive the scalar kinetic terms of Eq. (25.8). You can do this by taking the four- 
dimensional metric, at first, to be flat, and allowing only o to be a function of x. 

(3) Verify, by studying the OPEs of the vertex operators for the different massless fields, that 
the enhanced symmetry of the bosonic string at the point R = 1//2 is SU(2) x SU(2). 
Explain why, in the heterotic string, the symmetry is only SU(2). What is the symmetry 
in the ITA theory? 

(4) For the orbifold model, work out the spectrum in the untwisted sectors in greater detail, 
paying particular attention to spinorial representations of the O groups, and to the space- 
time spinors. In particular, make sure you are clear that the 27s are chiral, i.e. all the 
states in 27s have one four-dimensional chirality, all of those in 27 have the opposite 
chirality. 

(5) Derive the terms in Eq. (25.99) involving do?. 

(6) Verify that the Kahler potential of Eq. (25.109) properly reproduces the kinetic terms 
of the matter fields. 
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Compactification of string theory I. 
Calabi—Yau compactifications 


So far, we have focussed on rather simple models, involving toroidal compactifi- 
cations and their orbifold generalizations. But while by far the simplest, these turn 
out to be only a tiny subset of the possible manifolds on which to compactify string 
theories. A particularly interesting and rich set of geometries is provided by the 
Calabi—Yau manifolds. These are manifolds which are Ricci flat, Ryy = 0. Their 
interest arises in large part because these compactifications can preserve some sub- 
set of the full ten-dimensional supersymmetry. This is significant if one believes that 
low-energy supersymmetry has something to do with nature. It is also important at 
a purely theoretical level, since, as usual, supersymmetry provides a great deal of 
control over any analysis; at the same time, there is less supersymmetry than in the 
toroidal case, so a richer set of phenomena are possible. 

This chapter is intended to provide an introduction to this subject. In the first 
section, we will provide some mathematical preliminaries. Unlike the toroidal or 
orbifold compactifications, it is not possible, in most instances, to provide explicit 
formulas for the underlying metric on the manifold and other quantities of interest. 
The six-dimensional Calabi—Yau spaces, for example, have no continuous isome- 
tries (symmetries), so at best one can construct the metrics by numerical methods. 
Butitturns out to be possible to extract much important information without detailed 
knowledge of the metric from topological considerations. The machinery required 
to define these spaces and to extract at least some of this information includes al- 
gebraic geometry and cohomology theory, subjects not part of the training of most 
physicists. The following mathematical interlude provides a brief introduction the 
the necessary mathematics. There is much more in the suggested reading. 


26.1 Mathematical preliminaries 


Two notions are very useful for understanding Calabi—Yau spaces: differential forms 
and vector bundles. Differential forms have already appeared implicitly in our 
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discussion of IIA and IIB string theory. Start with an antisymmetric tensor field, 


A . Suppose that there is a gauge invariance: 


1 
Ai,..i, > Aiy..i, + —19ii Aisin — 0;, Nijiz.i, Fe? (—1)'8;, Aiy.i-sissiin fs 


(26.1) 


ipigetin 


where A is antisymmetric in all of its indices. We can write a shorthand for this: 
dbA=dA, (26.2) 


where dA is the “exterior derivative.” Acting on an antisymmetric tensor of rank 
p, the exterior derivative produces a rank p + 1 antisymmetric tensor, dH: 


pail 


We can think of this object more abstractly as follows. Antisymmetric tensors 


dAi,..i,,; = ( : ) [peti ag 0s Eta an Os -| ; (26.3) 


with p indices we will call p-forms. A “basis” for the p-forms is provided by the 
antisymmetrized products of differentials: 


dx! Adx® A+++ Adx'?, (26.4) 
We can then write: 


dx At aRGE?, (26.5) 


The product of two forms, A, B is known as the wedge product, A A B. If A is an 
n-form and B an m-form, 


'm! 
(AA B)incigin = ate Bisgscinan + (—1)? permutations (26.6) 
n mM): 
or, more compactly: 
1 
. . id im Gam Bins gictin A a A ager, (26.7) 
n mM): 


In this language, the exterior derivative can be written as d A H or simply dH, 
where d is thought of as a one form with components d; = 0;. 

It is important to practise with this notation, and some exercises are provided at 
the end of the chapter. One should check that 


dH =0. (26.8) 


It is instructive to write electrodynamics in the language of forms. One should 
verify that the field strength tensor is a two form, which can be written as 


F =dA. (26.9) 
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The homogeneous Maxwell’s equations (the Bianchi identities for the field strength) 
follow from d? = 0, 


dF =0. (26.10) 


Apart from multiplication and differentiation, there is another important opera- 
tion denoted by , and called the Hodge star. In d dimensions, this takes a p-form 
toad — p-form: 


1 id—p ay? 
(#ADis inp se pl Steady Pac ptt (26.11) 


A particularly interesting object is *d. For example, «xd Ad is a d-form. But the 
components of a d-form are necessarily proportional to €;,_;,. With a little work, 
one can show that: 


«(xd \ d) = 8°. (26.12) 
Using the * operation, we can write the action for a p-form field: 
1 
S= aon! AF (26.13) 
2(p + 1)! 


with F = dA. This is clearly gauge-invariant. It is easy to check that this reproduces 
the standard action for electrodynamics. 

For physics, we are particularly interested in zero modes of A, i.e. field config- 
urations that satisfy dA = 0, but which are not simply gauge transformations, i.e. 
which cannot everywhere be written as 


A=daA. (26.14) 


A simple example of what is at issue is provided by a gauge field on a circle, 
0 < y < 27 R. The one-form gauge field, 


Ay=dA A=cy (26.15) 


is nota pure gauge transformation unlessc = n/R. Inelectrodynamics, for example, 
this corresponds to the fact that the Wilson line, 


U = ei boars (26.16) 


is gauge-invariant, and non-trivial, again, unless c = n/R. 
This suggests that we want to consider closed p-forms, a, which satisfy 


da =0 (26.17) 
but that we are not interested in exact forms, 


a = dp. (26.18) 
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More generally, we want to define an equivalence class, known as the cohomology 
class of a. We will view a and a’ as equivalent if 


a’ =a+t+dB, (26.19) 


where f is well defined everywhere on the manifold. 

In general, for field configurations on a manifold M, the number of linearly inde- 
pendent zero modes is known as the Betti number, b,,. This number is related to the 
number of (basis) p-dimensional submanifolds which are not boundaries of p + 1- 
dimensional surfaces. We won’t prove this, but we will at least make it plausible. 
Consider integration of a p-form, a, over a p-dimensional submanifold, &: 


/ a, 4, 2D, (26.20) 
x 


By Stokes’ theorem, the integral of the exterior derivative of a p — 1-form, 8, over 
&, is related to the integral of 6 over the boundary of X: 


[4 - i B. (26.21) 


If & is compact, it has no boundary, so the integral of dB = 0. 
Two p-forms are in the same cohomology class if 


/ (a—a') = i: dpB=| B=0. (26.22) 
x >») CD») 


Note, as before, it is important in this expression that 6 is defined throughout the 
manifold. 

If we consider the structure of a massless chiral multiplet, we note that there 
are two scalars and a chiral fermion. In compactifications preserving N = 1 super- 
symmetry, modes of antisymmetric tensor fields which are annihilated by d will 
correspond to massless scalars; supersymmetry guarantees that the other elements 
of the multiplet are also present. The suggested readings at the end of the chapter 
contain more detailed discussion of these issues, but it is not too hard to understand 
how the various states in terms of the forms annihilated by d. The other massless 
scalar arises because one can also choose the form so the Laplacian vanishes. The 
Dirac operator is closely related to differential forms on manifolds. This can be 
shown using the creation—annihilation operator construction of the Dirac matrices 
we have used in our discussion of orthogonal groups. One can exhibit in this way 
the required pairing. 

With this machinery, we can define an important set of topological invariants 
of manifolds: characteristic classes. Consider a gauge field, F, F = dA. Note that 
F is closed: dF = 0. Also, F is said to be an element of H,(M, R), the second 
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cohomology group of the manifold M with real coefficients. The cohomology class 
of such two forms is known as the first Chern class. 

If the manifold is topologically non-trivial, then if we consider a gauge field, 
it may not be possible to describe the field everywhere by a single, non-singular 
potential. This problem is familiar to us from the case of the Dirac monopole. 
Instead, in different regions, a and f, we have to use different potentials. Aq), A,g). 
In regions where a and f overlap (transition regions), Aj) and A,g) will be gauge 
transforms of one another, 


Ata) = Aw) + Pap): (26.23) 
Another set of gauge fields are said to be in the same topological class if 
Ata) = Aig) + Pap) (26.24) 


with the same transition function, ¢. Now since the functions A and A are not 
uniquely defined everywhere, F = dA and F = dA are not in the trivial cohomol- 
ogy class, in general. On the other hand, F — F is, since the difference, A — A = B 
is well defined. So F — F = dB, and F and F are in the same cohomology class. 
So the cohomology class of F’,, the first Chern class, is a topological invariant. 

There is a theorem that if the first Chern class is non-zero, one can always find 
a two-dimensional surface, &, with the property: 


I(x) = ~| F+A0. (26.25) 
= 


Note that this is a kind of magnetic flux. By Dirac’s argument, /() is an in- 
teger. The first Chern class plays an important role in the theory of Calabi—Yau 
spaces. 

These ideas can be generalized to complex spaces. Here we define, as we did 
for the orbifold, complex coordinates, z; and z;. We then define a (p, g)-form 
w to be an object with p z;-type indices and q Z;-type indices. Note that y is 
totally antisymmetric in both types of indices. We can define two types of exterior 
derivatives, 0 and @, in the obvious way: 


1 


OWa, p10) .dg = pet 


Da; Way...apsath..d, +(—1)” permutations. (26.26) 


Note that a2 = 0; @ is defined similarly. In terms of these definitions, 
d=d+0. (26.27) 


These are known as the Dolbeault operators. We can then consider differential forms 
annihilated by these operators. The numbers of independent forms annihilated by 
the 0 and 0 operators are known as the Hodge numbers, h?. Then, for example, 
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one has the Hodge decomposition: 


b, = > hP-, (26.28) 
ptq=n 


Again, is is possible to choose these forms so that they are annihilated by the 
Laplacian. 


26.2 Calabi—-Yau spaces: constructions 


We have already constructed a rather rich set of four-dimensional string theories. 
But they are only a small subset of what appears to be a vast set of possibilities. 
We saw, for example, that the orbifold compactifications give rise to moduli which 
describe states which are not orbifolds. A rich set of compactifications of string 
theory, of which the orbifolds we studied in the last chapter are special cases, are 
provided by the Calabi—Yau spaces. In this section, we introduce these. 

Our strategy to construct solutions is to look for solutions of the ten-dimensional 
field equations. One can ask: why is this sensible? There are two answers. First, if we 
consider spaces in which the massless, ten-dimensional fields are slowly varying, 
it should be appropriate to integrate out the massive string modes and study the 
low-energy equations. A more serious question is: why can we simply look at the 
low-order equations? Even at the classical level, integrating out the massive states 
will lead to terms with arbitrary numbers of derivatives. This question is far more 
serious. If we solve the equations, say, involving two derivatives, we can try to 
find solutions of the terms up to four derivatives perturbatively. To do this, we 
expand the fields in modes of the lowest-order theory (e.g. eigenfunctions of the 
Laplace operator on the complex space). These are precisely the Kaluza—Klein 
modes. Calling these ¢,, plugging our lowest-order solution into the next order 
terms, we will obtain equations of the form: 


a’ 


= ln (26.29) 


(V? + m)on 
For m, # 0, i.e. for the massive Kaluza—Klein modes, we simply obtain a small 
shift. But the massless modes are problematic. In the case of Calabi-Yau compact- 
ifications, it is supersymmetry which will come to our rescue. We will see that, for 
the massless modes, the tadpoles (I’,,s) vanish. 

We begin with the Type II theory. Rather than examine the equations of motion, 
we look at the supersymmetry variations. In flat space four-dimensional theories, 
we are familiar with the idea that we find minima of the potential by setting the 
auxiliary fields to zero. We can phrase this in a different, seemingly more obscure 
way. We can find static solutions of the classical equations by requiring that the 
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supersymmetry variations of all of the fields vanish. That is, we require: 
6by=eF=0 bA=eED=0. (26.30) 


We will try the same strategy. In Chapter 17, we introduced the essential elements 
required to understand spinors in a gravitational background (the reader may want 
to reread Section 17.6). To make things simple, we will look for solutions where 
the antisymmetric tensor vanishes and the dilaton is constant, so only the metric 
is spatially varying. Then the condition that there be a conserved supersymmetry 
becomes: 


So 7 is covariantly constant. This means that under parallel transport around any 
closed curve, n returns to itself. As in gauge theories, the effect of parallel transport 
can be described in terms of Wilson lines, where now the Wilson line is written in 
terms of the spin connection, w: 


U = Pel foe, (26.32) 


The fact that 7 is unchanged under any such transformation greatly restricts the 
form of w. To see how this works, consider that in the ten-dimensional Lorentz 
group, there is an O(6) which acts on the compactified coordinates, as well as 
the four-dimensional Lorentz group acting on the Minkowski coordinates. The 
16-component spinor in ten dimensions decomposes under these groups as 


n = (4, 2) + (4, 2°). (26.33) 


By local Lorentz transformations, we can take the (4, 2) to have the form (suppress- 
ing the four-dimensional spinor index): 


(26.34) 


In order that this be invariant, we require that the spin connection lie in an SU(3) 
subgroup of O(6). The space is said to be a space of SU(3) holonomy. 

In general, w is an O(6) matrix. Restricting to SU(3) is a strong constraint. 
Already U(3) holonomy requires that the manifold be complex. We encountered 
this already in the orbifold case, where we introduced three complex coordinates 
and their conjugates. There is no unique way to introduce the complex coordinates. 
The continuous set of choices will lead to a set of moduli of our solutions, known 
as the “complex structure moduli.” In addition, a manifold of U(3) holonomy is 
Kahler. This means that the metric can be derived from a function K(x‘, x‘), the 
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Kahler potential, through: 
8ij = 0;0;K. (26.35) 


While proving that a manifold of U(3) holonomy must be Kahler is challenging, it 
is not hard to check that a Kahler manifold has U(3) holonomy. Some aspects of 
these manifolds are discussed in the exercises. 

The Christoffel symbols and curvature for a Kahler manifold can be written in 
quite compact forms. (Verification of these formulas is left for the exercises.) The 
components of the affine connection (Christoffel symbols) are given by 


Pe. = 8 Ogea 14, = 94 d5dbgca. (26.36) 


As a result, the non-zero components of the Riemann tensor are: 


Rig = ae; (26.37) 
and the Ricci tensor is 
Rie = —0-1%,. (26.38) 
Using 
ré, = 0, Indet g, (26.39) 
this can be further simplified: 
Ric = —95,0- Indet g. (26.40) 


Note that our result, Eq. (24.17), for the curvature of a two-dimensional Riemann 
surface is a special case of this. 

The requirement that the metric have SU(3) holonomy has a dramatic conse- 
quence for the curvature: the Ricci tensor vanishes. This follows from our discussion 
of the spin connection as a gauge field for local Lorentz transformations. On a six 
(real)-dimensional Kahler manifold, we have seen that the spin connection is not an 
O(6) field, but rather a U(3) field (in four dimensions, it is a U(2) field, etc.). The 
U(1) part of the Riemann tensor is the trace over the Lorentz indices — the group in- 
dices, thinking of the Riemann tensor as a non-Abelian field strength. But this object 
is the Ricci tensor. So SU(3) holonomy requires that the Ricci tensor itself vanish 
everywhere on the manifold. For such a configuration, the lowest-order Einstein 
equation is automatically satisfied, R;j = 0. The question which we would like to 
address is, given a Kahler manifold, is it possible to deform the Kahler potential 
so that the Ricci tensor vanishes. Clearly a necessary condition for this is that the 
integral, 


1 
c= = | mR (26.41) 
20 
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vanish. This quantity is the first Chern class, the topological invariant which we 
discussed earlier. It was Calabi who conjectured that the vanishing of the first Chern 
class for a manifold was a necessary and sufficient condition that the manifold admit 
a unique metric of SU(3) holonomy. Yau later proved this conjecture. The spaces 
constructed in this way are the famous Calabi—Yau spaces. In general, while one 
can prove that such metrics exist, actually constructing them is a difficult numer- 
ical problem. Fortunately, many properties relevant to the low-energy behavior of 
string theory on these manifolds can be obtained from more limited, topological 
information. 

It is worthwhile comparing this with our orbifold constructions. The orbifolds 
are everywhere flat. But the existence of a deficit angle associated with the fixed 
points means that there is actually a 6-function curvature; this gives precisely the 
holonomy of these manifolds. If we decompose the spinors, as before, then as 
we transport them about the fixed points, the i components pick up a phase, eu, 
while the 0 components are invariant. Correspondingly, we found one unbroken 
supersymmetry. 

When we discuss the heterotic theory on a Calabi—Yau space, we will have to 
choose values for the gauge fields as well. It will not be possible to simply set the 
gauge fields to zero. From the point of view of four dimensions, gauge fields with 
indices in the extra dimensions are like scalars, so this will result in breaking of 
some or all of the gauge symmetry. As we will see in Section 26.6.1, there are many 
possible choices for these fields, with distinct consequences for the structure of the 
low-energy theory. In an interesting subclass, some features of the heterotic theory 
are closely related to those of Type Il on Calabi—Yau spaces. 


26.3 The spectrum of Calabi-Yau compactifications 


In both the Type II and heterotic cases, many features of the low-energy spec- 
trum follow from general topological features of the manifold and do not depend 
on details of the metric. In the heterotic case, the number of generations (mi- 
nus the number of anti-generations) is a topological invariant. Suppose that we 
have some number of generations for some choice of metric. If we now make 
smooth, continuous changes in the metric, the massless spectrum can change 
as generations and anti-generations pair to gain mass or become massless. In 
other words, a mass term in an effective action can pass through zero, but the 
net number of generations cannot change. In some cases, other features of the 
spectrum are similarly invariant. So while it is difficult to write down explicit 
metrics for manifolds of SU(3) holonomy, it is possible to determine many im- 
portant features of the low-energy theory from basic topological features of the 
manifold. 
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In the Type II theory, the numbers of hypermultiplets and vector multiplets are 
separately topological. They do not pair up as one moves about on the moduli 
space; the N = 2 supersymmetry insures that if a field is massless at one point in 
the moduli space, it is massless at all points. Even more dramatic is the fact that 
massless states found in the lowest order of the a’ expansion are exactly massless. 
So it is enough to study the lowest-order supergravity equations of motion to count 
massless particles. 

The important non-zero Hodge numbers are h”! and h!:!. In the IIA theory, 
there are h':! vector multiplets and h?:! hyper-multiplets. In the IIB theory, this 
is reversed. In the heterotic case, the (2, 1) forms will correspond, effectively to 
generations, the (1, 1)-forms to anti-generations. 

The counting of massless fields is not difficult to understand. Since we have 
taken the antisymmetric tensor fields and fermions to vanish in the background, 
the equations for these fields are particularly simple. Consider the antisymmetric 
tensor, B,,,. On a complex manifold, as we explained earlier, there are h'! 1,1) 
forms, ae and h! (2, 1)-forms annihilated by the operators 0 and 0. Since the 
corresponding three-index field strengths, H = dB, vanish, there is no energy cost 
to giving a constant expectation value to the associated four-dimensional fields; 
they correspond to massless scalars in four dimensions. The fields connected to the 
(1, 1)-forms, b;7, are easy to describe. In addition to the antisymmetric tensor, there 
is also a massless perturbation of the metric: 


igi(®, ¥) = O@)bii(y). (26.42) 


Here x refers to the ordinary four-dimensional Minkowski coordinates, and y refers 
to the compactified coordinates. Similarly, in the IIA theory, one can find a massless 
gauge field, rounding out the bosonic components of the vector multiplet. This 
comes from the three-index Ramond field, 


Cyii(%, Y) = Aw )bi). (26.43) 


We will leave for the reader the problem of working out the structure of the hyper- 
multiplets in terms of the (2, 1)-forms, and also determining the pairings in the IIB 
case. 

One (1, 1)-form which is always present is the Kahler form: 


bf 185, Di = 18s (26.44) 

This satisfies 
ab = 0b = 0 (26.45) 
because g;; = 0;0;K. The real scalar which sits in the multiplet with b* is just 


the metric itself. The corresponding massless field is the radius of the compact 
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space: 
g(x", 2’) = R'(x")g, 7(z); Bul", 2!) = bx") d,q(Z). (26.46) 


That the field is massless is no surprise; the condition R;; = 0 is not changed under 
an overall rescaling of the metric so the vev is undetermined. 


26.4 World sheet description of Calabi-Yau compactification 


So far, we have described compactification of string theory in terms of ten- 
dimensional space-time. This analysis makes sense if the radius of the compactified 
space is large compared to the string length, £,. We can also formulate these ques- 
tions in world sheet terms. This provides a complementary way to understand many 
features of the compactified theory. This is useful for at least two reasons. First, 
it provides tools to ask what happens when the compactification radius is of order 
the string scale or smaller. Second, there are some features of the spectrum and 
interactions which are more readily accessible in this framework. 

In the Type II theory, the non-linear sigma model which describes compactifi- 
cation on a Calabi—Yau space has some striking features. First, in the absence of 
background antisymmetric tensor fields, it is left-right symmetric. Second, there 
are two left-moving and two right-moving supersymmetries on the world sheet, as 
opposed to the one left-moving and one right-moving supersymmetry of a general 
configuration. This can be usefully understood in a number of ways. In the light 
cone gauge, one can work with the covariantly constant spinor 7 and its conjugate, 
n, to construct two left-moving and two right-moving supersymmetry generators, 
both in the sense of the world sheet and in space-time. We have already seen this 
in the case of orbifold constructions. There, in the light cone gauge, we have eight 
left-moving and eight right-moving supersymmetry generators, before the orbifold 
projection. We can organize these in terms of their transformation properties under 
the SU(3) x U(1) holonomy group. For both the left and right movers, there are 
triplets, Q;, anti-triplets, Q;, and singlets, Qy and Qo. The triplets and anti-triplets 
are charged under the U(1); the singlets are not. The orbifold projection eliminates 
the triplets. The two singlets survive. 

In a purely world sheet description, non-linear sigma models described by a 
Kahler metric automatically have two left-moving and two right-moving super- 
symmetries. To describe these, we can introduce a superspace with four Grassmann 
coordinates, two left movers and two right movers: @ ik and 64. This superspace can 
be thought of as the truncation of N = 1 supersymmetry in four dimensions. We 
can define, as in four dimensions, operators D, and d,, and left- and right-moving 
chiral fields annihilated by the ds. Correspondingly, we can define chiral left- and 
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right-moving fields: 
X'(z, 0) = x'(z) + Of wi) (z) + auxiliary field (26.47) 


and similarly for X ‘ In terms of these fields, we can write the action of the con- 
formal field theory as: 


/ do / d’0,d°0_K(X, X). (26.48) 


Integrating over the 0s, the bosonic terms are just f d0g;70yx! d%x!, with g,; the 
Kahler metric. 

The superconformal algebra, in these backgrounds, is enlarged to what is referred 
to as the N = 2 superconformal algebra (one such algebra for the left movers, one 
for the right movers). In addition to the stress tensor and the two supercurrents, this 
algebra contains a U(1) current. The supersymmetry generators can be constructed 
by the Noether procedure. They can also be guessed by taking the generators in a 
flat background, and making the expressions covariant: 


Gt=¢;Dx'y' Gy =8;DX'v'. (26.49) 


These have opposite charge under the U(1) current (an R current) constructed from 
the fermions: 


i@=V@w®@ (26.50) 
and a similar current for the left movers. The full algebra is: 


+ 3 ad. Today 
T(z)G~(0) © 7? a ow 


1 1 
T(z) j (0) * iO) - ee 


JZ)G*(0) © 426%) (26.51) 


These equations say that G has dimension 3/2, while j has dimension 1, and G* 
have U(1) charges plus and minus one. The central charge appears in the relations: 


_ 2c i. 2 | oe 
G*(z)G" (0) © | + SJ(0) + [T(0) + —9j(0) 
3z Zz b z 
G*(z)G*(0) ~ 0 
Cc 
>: 


JQIO*~ o> (26.52) 

3z 

The non-linear sigma models appropriate to heterotic compactifications on 
Calabi—Yau spaces have a number of interesting features. We will see that, for a par- 
ticular choice of gauge fields, the world sheet theory which describes the heterotic 
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compactification is identical to that of the Type II theory. Thus, again, they have 
two left-moving and two right-moving supersymmetries ((2, 2) supersymmetry). 
The fact that the world sheet theories of the two different string theories are the 
same allows us to argue, as we will below, that Calabi-Yau spaces are solutions of 
the full, non-perturbative string equations of motion. But this observation also tells 
us about interesting features of the spectrum. 

To understand the spectrum, it is helpful to ask, first, what is a vertex operator 
from the perspective of the two-dimensional conformal field theory? The answer 
is that a vertex operator is a marginal deformation of the theory, a perturbation 
of dimension 2 ((1, 1), in terms of the left- and right-moving Virasoro algebras). 
The standard way to compute the dimensions of operators is to treat them as per- 
turbations, and calculate, for example, the beta function of the perturbation. For 
marginal operators, the beta function vanishes to first order. Moduli correspond to 
“exactly marginal deformations” of the theory. For these, the beta functions vanish 
to all orders in the perturbation (and non-perturbatively), corresponding to the fact 
that the theory, even for a finite perturbation, is conformal. 

The existence of moduli means that there is a multiparameter set of conformal 
field theories. Varying the action with respect to the parameters yields operators 
which are exactly marginal. In this way, we have the two-dimensional version of 
the correspondence between moduli and massless fields. 

An example of a modulus is the radius of the complex space. The lowest-order 
equation for the metric is invariant under an overall scaling of lengths. But this 
is not obviously true of the higher-order corrections. For Type II theories, the 
space-time supersymmetry guarantees that there is no potential for the moduli, 
so the sigma model is a good conformal field theory, suitable for heterotic string 
compactification. On the heterotic side, we can also give a more direct world- 
sheet argument. Here R~° is the coupling constant of the o model. In other words, 
writing the metric as R? times a reference metric of order the string scale, R* 
scales out of the Lagrangian. We know that the lowest-order beta function equation 
is the same as the field theory equation. It is trivially independent of R7, since 
it is a one-loop effect. For higher orders, there is a non-renormalization theorem. 
This follows from a combined world sheet, space-time argument. The superpartner 
of fluctuations in the radius is the fluctuation of the antisymmetric tensor field, 
b;; = ig;z. The associated vertex operator (term in the action) is a total derivative 
on the world sheet at zero momentum. It is perhaps easiest to see this by writing 
the vertex operator at zero momentum in the form: 


Vp = buneP dg X™ agX™ 
= dy dy Ke Oy X™ OgXN 
= dyfe%dgX™ Jy KI. (26.53) 
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So b decouples at zero momentum. Because b is in a supermultiplet with R?, this 
means that the superpotential, which is a holomorphic function of the superfields, 
is independent of R?. 

Actually, this statement is not precisely correct, because K is not single-valued. 
In perturbation theory, it is true, since one is not sensitive to the global structure of 
the manifold (in perturbation theory, all fluctuations are small). Non-perturbatively, 
one can encounter instantons in the world sheet theory. A more detailed analysis 
is required to determine whether there are corrections to the superpotential. In 
left-right symmetric compactifications of the heterotic string, those with two left- 
moving and two right-moving supersymmetries ((2, 2) models), a study of fermion 
zero modes in the presence of the instanton shows that no superpotential for the 
moduli is generated; this is consistent with the expectations from the Type II theory. 
For compactifications with two right-moving but no left-moving supersymmetries 
((2, 0) models), corrections can be generated, though in some cases intricate cancel- 
lations still prevent the appearance of a potential for the moduli. These two classes 
of models are phenomenologically quite distinct, as we will see shortly. 


26.5 An example: the quintic in CP* 


It is helpful to have a concrete example of a Kahler manifold with c; = 0, on 
which we know one can construct a metric of SU(3) holonomy. We have previ- 
ously encountered the complex projective spaces in N dimensions, CP’. These are 
defined as the spaces with N + 1 complex coordinates, Z,, with the identification 
Za — AZq, for any complex number A. We have written down a Kahler potential 
on this space: 


N 
K =I1n ( +0 22.) (26.54) 


a=1 


Any complex submanifold of a Kahler manifold is also a Kahler manifold; one 
can simply take the Kahler potential to be the Kahler potential of the full manifold 
evaluated on the submanifold. To obtain a manifold with three complex dimensions, 
we can start with CP*, and write an equation for the vanishing of a polynomial, 
P(Z). The polynomial should be homogeneous, in order that it have a sensible 
action in CP’. It turns out that it should satisfy other conditions. Its gradient should 
vanish, at most, at the origin (which is not a point in CP’). In order that the first 
Chern class vanish, it should be quintic. We will give an argument for this shortly. 
The simplest (most symmetric) possibility is: 


P=Z3+234+2734+7Z734+Z2=0, (26.55) 
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but there are obviously many more. We can deform this polynomial by adding other 
quintic polynomials. These correspond to varying the complex structure. Since each 
deformation produces another solution of the string equations, each deformation 
corresponds to a modulus, the “complex structure moduli.” Associated with each of 
these deformations is a form of type (2, 1), which we will not attempt to construct 
here. 

Before listing the deformations, note that not every deformation corresponds to 
a change of the physical situation — and thus to a massless particle. Holomorphic 
changes of the coordinates which are non-singular and invertible do not change the 
complex structure. The transformation 


Zi > Z+€4Z; (26.56) 


is well-defined in CP*. As a consequence, deformations such as ZiZ3 are not 
physical. So we can list the possible deformations: 


LOTR ete Dlg ie DeLee wes’ Dlg hala nt Dill tals: (051) 


All together there are 101 possible deformations of the polynomial, corresponding 
to hz; = 101. In this example, there is only one Kahler modulus, the overall radius 
of the compact space. 

We can understand heuristically why the first Chern class vanishes in a way which 
we will help us understand other features of these manifolds. A characteristic feature 
of the Calabi-Yau spaces is the existence of a covariantly constant 3-form, @j;x. 
The existence of this form follows from the existence of the covariantly constant 
spinor, 77: 


ijk = NV jRN- (26.58) 


Working in terms of the creation—annihilation operator basis for the I's, one sees 
that w is holomorphic. The I’;s can be defined so that the Ij matrices annihilate 
n. Then, because of the complete antisymmetrization, only components of w with 
i, j, k indices are non-vanishing. In the space defined by the vanishing of a quintic 
polynomial in CP*, we can show that there exists a holomorphic three-form which 


is everywhere non-vanishing. Calling x! = Z;/Zs,i =1,..., 4: 
aP\"' 
w = dx' A dx? Adx* (3) (26.59) 
Ox 


One can show that this expression does not depend on singling out a particular 
coordinate, and that it is not singular at the points where the derivative vanishes, 
provided that the polynomial P is quintic and that the gradient of P vanishes only 
at the origin. The existence of such a form can be shown to be equivalent to the 
vanishing of the first Chern class. 
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26.6 Calabi-Yau compactification of the heterotic string at weak coupling 


Much effort has been devoted to the study of compactifications of the weakly 
coupled heterotic string on Calabi-Yau spaces. These theories have many of the 
features of the Standard Model. They also allow one to consider many of the ques- 
tions of beyond the Standard Model physics. Before beginning an analysis of these 
models, it is worth listing some of the points that we can address in this framework. 


(1) Low-energy supersymmetry: solutions of the classical equations of the heterotic string 
theory on Calabi—Yau spaces exist. They have N = 1 supersymmetry. Supersymmetry, 
as in field theory, is unbroken to all orders of perturbation theory, but may be broken 
non-perturbatively. 

(2) Low-energy gauge groups: the simplest constructions have gauge group Ex x E¢, bro- 
ken perhaps by Wilson lines, which preserve the rank of the gauge group. But many 
models have a moduli space in which the gauge group is broken to precisely that of the 
Standard Model. 

(3) Generations: the number of generations is typically determined in terms of topological 
features of the underlying manifold. 

(4) Massless particles, not protected by symmetries: various massless states arise, which 
are not protected by chiral symmetries. This is precisely what we want in order to 
understand the presence of light Higgs fields in supersymmetric theories. We know that 
if such fields are present in the low-energy field theory, they are protected from gaining 
large masses by non-renormalization theorems. In field theory, the vanishing of such 
mass terms appears mysterious; in these string constructions, it is automatic. Such states 
could play the role of Higgs fields in supersymmetric models. In other words, the wu 
problem of ordinary supersymmetric field theories is readily solved in this framework. 

(5) Unification of couplings: the string theories we are studying are not grand unified 
theories in the conventional sense. There is no energy scale at which these compactifi- 
cations appear as four-dimensional theories with a single unbroken gauge group. Yet, 
generically, couplings are unified. These last two points, which we will see are easy 
to understand in terms of the microscopic structure of string theory, are quite surpris- 
ing from a low-energy point of view. They have sometimes been referred to as “string 
miracles.” 

(6) Continuous and discrete symmetries: it is easy to prove that, for these compactifica- 
tions (and for weak-coupling heterotic models in general) there are no continuous global 
symmetries; all continuous symmetries must be gauge symmetries. Discrete symme- 
tries, on the other hand, proliferate, and might play the role of R-parity or lead to other 
interesting phenomena. These discrete symmetries are typically gauge symmetries, in 
the sense that they are residual symmetries left over after the breaking of continuous 
gauge symmetries. 


We will also see that there are a number of problems with these models, which 
illustrate some of the basic difficulties in developing a string phenomenology. 
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(1) There are too many of them. While there are models with three generations (many), 
there are models with hundreds of generations, with non-standard gauge groups and the 
like. 

(2) The problem of moduli: non-perturbatively, moduli can acquire potentials, but they 
typically vanish in various asymptotic regimes. Simple general arguments indicate that 
stable, supersymmetry-breaking minima, if they exist, must be in regions which are in- 
herently strongly coupled, in the sense that no weak coupling approximation is available. 

(3) The problem of the cosmological constant: this is closely related to the previous one. In 
many instances moduli potentials can be calculated. For any given value of the moduli, 
the size of these potentials is scaled as one would expect by the scale of supersymmetry 
breaking. As a result, even if strongly coupled, stable minima exist, it is not clear why 
the cosmological constant should be small at these points. 


We will not offer a solution to these problems in this chapter, but will explore at 
least one proposed answer known as the “landscape” in the concluding chapter. 


26.6.1 Features of Calabi-Yau compactifications of the heterotic string 


In the previous section, we asserted that, in suitable backgrounds, the world sheet 
conformal field theory which describes the heterotic string is the same as that which 
describes the Type II theory. Here, we describe compactifications of the heterotic 
string theory in more detail. 

To construct solutions, we still look for solutions which preserve a space-time 
supersymmetry. Again, we require the supersymmetry variation of the gravitino to 
vanish, giving D,,n = 0, so once more we need a covariantly constant spinor. There 
is now an equation for the variation of the ten-dimensional gaugino, as well: 


dA « TY Fin. (26.60) 


One strategy, then, to find solutions which preserve N = | supersymmetry is to 
require that F;;I''/ is an SU(3) matrix. There is a simple Ansatz which achieves 
this. Both Eg and O(32) have SU(3) subgroups: 


SU(3) x Eg x Eg C Eg x Eg SU(3) x O(26) C O(32). (26.61) 


On the Calabi—Yau space, the spin connection is an SU(3) valued field, so take 
the gauge field to be a field in one of these SU(3) subgroups. Then for gauge 
generators not in SU(3), Eq. (26.60) is automatically satisfied. For those in SU (3), 
the condition is mathematically identical to that for the gravitinos, and is again 
satisfied. 

This Ansatz satisfies another condition. We put the antisymmetric tensor field B 
to zero, but, because of the Chern—Simons terms, this does not by itself guarantee 
that the field strength H is zero. But with this Ansatz, the Chern—Simons terms for 
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the gauge and gravitational fields are identical. As a quick check, note that 
dH=TrRAR-TrFAF, (26.62) 


and these terms clearly cancel. 

This establishes that this is a solution of the equations of motion to lowest order 
in the a’ expansion. But there is another way to see this, which will allow us to 
establish, as we did for the Type II theory, that this is an exact solution, perturbatively 
and non-perturbatively. If we write the non-linear sigma model which describes the 
heterotic string in this background, it is identical to that for the Type II theory. To 
see this, as in the orbifold case, we divide the left-moving gauge fermions into 
three sets. First, there are the fermions in the “other” Es, which are not affected 
by the background gauge field and remain free, 14, A = 1,... 16. In the first Es, 
10 fermions, 4°,a = 1,..., 10 (transforming as a vector in the O(10) subgroup of 
E¢) are also free. The remaining six interacting fermions can be grouped, like the 
left-moving coordinates, into three complex fermions, 2! and 2!. These fermions 
interact in precisely the same way as the left-moving fermions in the Type II theory. 
This can be seen by writing the action of the Type II fermions in terms of the vierbein 
and spin structure, rather than the metric and the Christoffel connection. 

We see from this that the moduli of the Type II theory are also moduli of the 
heterotic theory. Actually, we knew this had to be, since we know that each of these 
conformal field theories, on the Type II side, is a good conformal field theory for the 
heterotic theory. But we can also see this pairing more directly in the language of 
vertex operators. Here it is somewhat more convenient to work in the RNS picture. 
The vertex operators correspond to small deformations of the background in the 
directions of the moduli. In the Type II theory, they are built from right-moving 
fields, 0X‘ and yi , and left-moving fields, ax, a . Inthe heterotic case we can trade 
w' with A’. Since the action for the 4’s is the same as for the y's, the dimensions 
of the vertex operators are exactly the same. This does not preclude the existence 
of additional moduli on the heterotic side, and we will see that there are typically 
additional moduli in these compactifications. 

While all moduli of the Type II theory are moduli of the heterotic theory, not 
all heterotic moduli correspond to states of the Type II theory. Vertex operators for 
moduli which preserve only two right-moving supersymmetries ((2, 0)) are not suit- 
able vertex operators for the Type II theory. The moduli we are considering here are 
distinguished because they preserve the two left-moving world sheet supersymme- 
tries, and we will refer to these as Type II moduli. Perhaps more interesting, though, 
than the pairing of moduli is is a pairing of the Type II moduli with matter fields. 
The moduli associated with (2, 1)-forms are paired with 27s of E.; (1, 1) moduli 
with 27s. This is most readily seen in the language of vertex operators, using the 
world sheet superconformal symmetry. The vertex operators for the Type II theory 
are the highest components of the corresponding superconformal multiplets with 
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respect to both left- and right-moving supersymmetries. In superspace, they are the 
Be 6? components of operators of the form: 


FX x): (26.63) 


The 6,62 component has dimension (1/2, 1). We can form an operator of dimension 
(1, 1) by multiplying by 4“, one of the free fermions. This operator is not highest 
weight with respect to the left-moving N = 2 algebra, but this is not a problem; this 
symmetry is not a gauge symmetry on the world sheet, but simply an “accident” of 
our choice of background field. It is highest weight with respect to the left-moving 
Virasoro algebra, which is all that matters. 

We already observed this pairing in the Z3 orbifold model, which is a special 
case of the Calabi—-Yau construction. In the untwisted sector, the vertex operators 
for the moduli took the form, on the left: 


ax", (26.64) 
while for the 27s they took the form: 
Par ha (26.65) 


The supersymmetry transformation of the latter operator changes A! to 0X’. 

The distinction between 27s and 27s is readily understood. In the Type II case, 
we can distinguish two types of moduli, depending on their charges under the U(1) 
within the left-moving N = 2 algebra. In the orbifold context some vertex operators 
involve 0X‘, some 3X’. In the heterotic case, the world sheet U(1) symmetry 
corresponds to the U(1) subgroup of FE in the decomposition O(10) x U(1) C Eg. 
This U(1) charge is precisely what distinguishes the 10s, for example, in the 27 and 
27. In the Type II case, this distinction corresponds to the distinction between (2, 1) 
and (1, 1) moduli, so we obtain precisely the pairing we described above (note that 
what one calls a 27 and a 27 is a matter of convention; if one adopts the opposite 
convention, the identification is reversed). 

This result holds everywhere in the moduli space; since the number of moduli 
of each type does not change as one moves in the moduli space, the number of 27s 
and 27s does not change. This is a surprising result. One might have thought that, 
in a complicated construction such as this, 27 and 27s would, whenever possible, 
pair to gain mass. But this is not the case. This is precisely the sort of phenomena 
one needs to understand light Higgs particles in supersymmetric theories. We will 
see shortly how this works in a more detailed model. 


26.6.2 Gauge groups: symmetry breaking 


So far, the heterotic models we are considering have group Eg x E¢. If we are to 
describe the Standard Model, we need to be able to break this symmetry. We have 
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seen in the case of toroidal compactifications that gauge symmetries can be broken 
by expectation values of gauge fields with indices in compactified dimensions. 
Stated in a more gauge-invariant fashion, these are non-trivial expectation values 
for Wilson lines. In the Calabi—Yau case, the same is possible. 

Let’s consider a specific example: the quintic in CP*, with vanishing of the 
polynomial: 


Z3+Z3+Z3+Z34+Z2=0. (26.66) 


The corresponding Calabi-Yau manifold, we saw, has 101 27s and one 27. This 
polynomial has a variety of symmetries. As for the torus, we can use these to project 
out states and simplify the spectrum. Consider, for example, the symmetry: 


Qui 


Zi >aiZ; a=es. (26.67) 


This is a symmetry of the polynomial. It is somewhat different than the orbifold 
symmetries we have discussed, since, as the reader can check it acts without fixed 
points. Mathematicians call such a symmetry “freely acting.” For physics, it means 
that if we mod out the Calabi-Yau by this symmetry, while it is still necessary to 
include twisted sectors, the twisted strings have mass of order R, the Calabi-Yau 
radius, and there are no light states in these sectors if R is large. 

We can readily classify the states invariant under this symmetry. Among the 
moduli, there are 21 hz, fields, associated with polynomials such as Z *Z3 Z4 and 
Z,Z2Z3Z4Z5. The Kahler modulus (overall radius) is also invariant under this 
transformation, and so survives the projection. The corresponding Euler number is 
40, 1/5 the Euler number of the covering space. There are, as well, 21 27s of E¢ 
and one 27. Further symmetries can be used to reduce the number of generations 
to as few as four. 

But what interests us here is obtaining smaller gauge groups. We can define 
the Z; to include a transformation in E,¢. This is equivalent to the presence of a 
Wilson line on the manifold. An interesting way to do this is to consider a somewhat 
different decomposition of E than we have considered up to now: 


SU(3) x SU(3) x SU(3) C Es. (26.68) 


An example of a Wilson line in this product of SU(3)s is: 
1 0 0 a 0 O a 0 0O 

U={]0 1 07x{O0 a O}]x{Oa Oo (26.69) 

0 0 1 0 0 o& 0 0 a 


Ww 


This breaks E¢ to SU(3) x SU(2) x SU(2) x U(1)’. 
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26.6.3 Massless Higgs fields, or the 1 problem 


When we mod out so as to reduce the gauge symmetry, we also alter the spectrum. 
We have seen that we greatly reduce the number of moduli and the number of 
generations. The presence of the Wilson lines also disrupts the left-right symmetry 
of the model. As a result, the pairing of moduli and matter fields is no longer quite 
so simple. 

In the presence of the Wilson line, one still obtains 20 complete E¢ generations. 
If one thinks, loosely, of some of the massless fields “gaining” mass, elements 
of the 27 and 27s must pair up to gain mass. More precisely, in this modding 
out procedure, states disappear, but they must disappear in pairs. But one also 
obtains some incomplete multiplets, where paired states do not disappear. Con- 
sider the 27. This is invariant under the original Z5s, so any state which survives 
must be invariant under the Wilson line. Using the decomposition of the 27 under 
SU(3): 


27 = (3, 1,3) + 3, 3, 1) +, 3, 3). (26.70) 


So we obtain Zs singlets from only the third multiplet. These form a (1, 2, 2) under 
SU(3) x SU(2) x SU(2), as well as a singlet. There is a corresponding pair of 
states from the 27s. This is the sort of multiplet we would like to understand the 
presence of light Higgs particles in supersymmetric models: massless states, at tree 
level, which arise, from a low-energy point of view, more or less by accident. 


26.6.4 Continuous global symmetries 


In the heterotic string theory, there are no continuous global symmetries. We won’t 
give the formal proof here, but the basic argument is not hard to understand. If 
there is a global symmetry, it should be a symmetry of the world sheet theory. 
In this way, we are guaranteed that vertex operators can be chosen to have well 
defined transformation properties, and so the S-matrix will transform properly. 
The global symmetry will be associated with a world sheet current. This current 
can be decomposed into left- and right-moving pieces. But from any left-moving 
current, we can build a gauge boson vertex operator, so the symmetry is necessarily 
a gauge symmetry. Right-moving currents will not commute with the world sheet 
supersymmetry generators, and will not have well-defined action on states (in the 
BRST language, they do not commute with the BRST operator). So they are not 
symmetries in space-time. 

This argument also indicates that there are no global symmetries in the Type II 
theories. This is in accord with our expectation that global symmetries are unlikely 
to arise in a theory of quantum gravity. 
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26.6.5 Discrete symmetries 


When we studied orbifold models, we found discrete symmetries existed in a subset 
of vacua on the full moduli space. This is also the case for the Calabi-Yau manifold 
constructed from the vanishing of a quintic polynomial in CP*. Such symmetries 
turn out to be quite common. 

The quintic polynomial, P = )~ Z?, exhibits a set of Zs symmetries: 


2ni 


Zi > aZ; a=e>s. (26.71) 


An overall phase rotation of all of the Z;s has no effect in CP*, so the symmetry here 
is Ze There is also a permutation symmetry, Ss. This symmetry group is a subgroup 
of the O(6) symmetry which would act on six non-compact, flat dimensions. We can 
thus think of these symmetries as discrete gauge transformations. So their existence 
in a theory of gravity is not surprising. 

We would like to know if these symmetries are R-symmetries or not. We can 
address this by asking their effect on the covariantly constant spinor, n. This is 
more challenging to do than in the orbifold context, since we do not have quite such 
explicit expressions. It is simplest to look at the covariantly constant 3-form. We 
already gave a construction: 


aP\"! 
w =dx! Adx* Adx* (s) (26.72) 


with x! = Z;/Zs. This construction treats the coordinates asymmetrically, but, as 
we explained, w is symmetric among the coordinates. Note that w transforms es- 
sentially like 7, i.e. like 67. So symmetries under which w transforms non-trivially 
are R-symmetries, and W transforms like w. 

Consider, first, the Z5 transformations of the separate Z;s. We can read off 
immediately how m transforms under transformations of the first three; the other 
two follow by symmetry. So 


o> ar‘, (26.73) 


Similarly, under those $5 transformation which permute Z,, Z2, Z3, we can see 
how q transforms. If the permutation is odd, w changes sign. So again, the general 
Ss transformation is an R-symmetry. 

Turning on the various complex structure moduli typically breaks some or all 
of this symmetry. For example, if we turn on the modulus associated with the 
polynomial 


2122232425 (26.74) 
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we break the Z? symmetry down to a subgroup satisfying )°k; = 0 mod 5. This 
group is Z3, but it is anon-R-symmettry, in light of the transformation law of w. An 
expectation value for this field clearly preserves the permutation symmetry. 

Similarly, turning on, say, aZ >Zo + bZ nee breaks the symmetries acting on Z 
and Z2, as well as some of the permutation symmetry. Turning on enough fields 
breaks all of the symmetry. 

One might ask why one should be interested in points or surfaces in the moduli 
space which preserve a discrete symmetry, when in the bulk of the space there is no 
symmetry. This question is closely related to the question: what sorts of dynamics 
might fix the moduli? This is a subject which we will deal with extensively later, but 
for which we will provide no definitive resolution. But even without understanding 
this dynamics, there is a simple reason to suspect that points in the moduli space 
with symmetries might be singled out by dynamics. Imagine that we somehow 
manage to compute an effective potential for the moduli, arising, perhaps, due to 
some non-perturbative string effects. Symmetry points are necessarily stationary 
points of this effective potential. There is, of course, no guarantee that they are 
minima of the potential, but they are certainly of interest as candidates for string 
ground states. 

There are, as we have seen, certain facts of nature which suggest that discrete 
symmetries might play some role in extensions of the Standard Model, including 
proton decay and dark matter. 


26.6.6 Further symmetry breaking: the Standard Model gauge group 


The Wilson line mechanism, as we have described it, provides a path to reduce 
the gauge symmetry from E¢ x Es, but leaves the rank untouched.! We can hope 
to reduce the gauge symmetry further by giving expectation values to some of the 
matter fields. Ideally, these expectation values would be large. The presence of 
other gauge groups (as well as unwanted matter multiplets) can spoil the prediction 
of coupling unification, and lead to severe difficulties with proton decay and other 
rare processes. We are led, then, to ask if we can consider more general states, in 
which the spin connection is not equal to the gauge connection, and the rank is 
reduced. 

This is acomplex subject, which has only been partially explored. At lowest order 
in the a’ expansion, there are such flat directions. They are not left-right symmetric, 
and while, in order that they exhibit space-time supersymmetry, they have two right- 
moving supersymmetries, they have only one left-moving supersymmetry. So they 
are not suitable backgrounds for Type II theories, and one cannot argue as easily as 


' Non-Abelian discrete symmetries offer possibilities for reducing the rank, but we will not explore these here. 
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for the standard embedding that these (0, 2) configurations are solutions of exact 
classical string equations. They are still subject to perturbative non-renormalization 
theorems in a’. But detailed study of instanton amplitudes is required to determine 
if these flat directions are lifted non-perturbatively, i.e. by corrections of the form 
ene, 

There are, however, a class of vacua with Standard Model gauge group which 
can be found by symmetry arguments, much as we found additional flat directions 
in the Z3 orbifold model. Consider, again, the quintic in CP°, with the symmetric 
polynomial. We can find flat directions of the D terms by taking 27 = 27. More 
precisely, starting with the EL decomposition into O(10) representations, 


27 = 10' + 1-5 + 16_1;2, (26.75) 


we can give expectation values to the singlets in the 27 and one of the 27s. These are 
also flat directions of the F terms. For example, consider the 27 corresponding to the 
polynomial Z;ZZ3Z4Zs5. The product, 27 27 is invariant under all of the discrete 
R-symmetries; no terms of the form (27 27)” can appear in the superpotential. So 
this direction is exactly flat (terms of the form 27°, 27° cannot lift these directions 
either). In combination with Wilson lines, these flat directions readily break to the 
SU(3) x SU(2) x U(1) group of the Standard Model. 


26.6.7 Gauge coupling unification 


One of the striking successes of low-energy supersymmetry is its prediction of 
unification. Within the context of grand unification — where the gauge group of 
the Standard Model is unified in a simple group at a scale Mgur — the fact that 
the couplings unify is readily understood. In the context of the compactifications 
considered here, it is not immediately obvious why this should be the case. In the 
case of symmetry breaking by Wilson lines, for example, the compactification scale 
and the scale of the symmetry breaking are of the same order. So there is no energy 
scale where one has a unified, four-dimensional effective theory. 

In the weakly coupled heterotic string, however, the couplings do unify under 
rather broad conditions. In the case of Wilson line breaking this can be understood 
immediately in field-theoretic terms. The effect of the Wilson line is to eliminate 
states from the EF unified theory, but at tree level no couplings are altered. So the 
couplings of all groups emerging from F¢ are the same. Perhaps more surprising is 
the fact that the Eg and Ex couplings are the same. This can be seen by considering 
the vertex operators for the gauge bosons in each group. In both cases, the vertex 
operators are constructed in terms of free two-dimensional fields, which obey the 
same algebra (in the unbroken subgroup) as in the flat space theory. So, for example, 
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the operator product expansions of these gauge boson vertex operators are unaltered. 
There are constructions where unification does not hold. These involve replacing the 
two-dimensional fermions with a current algebra with a different central extension. 

In the (2, 1) flat directions considered above, we can give an argument based 
on the low-energy field theory that the couplings remain unified as one moves out 
along the flat direction. A change in the coupling requires that there be a coupling 
of this modulus to the gauge fields. But at the classical level, we know that there 
are no such couplings, because any such coupling would violate the axion shift 
symmetry. This symmetry is unaffected by the expectation value of these moduli. 

When we come to consider strongly coupled strings, the problem of coupling 
unification will be more complicated. It will be less clear in what sense unification 
is generic. Whether this is a problem for the theory, or a clue, is a question for the 
student to ponder. 


26.6.8 Calculating the parameters of the low-energy Lagrangian 


As we have explained, string theory is a theory without fundamental dimensionless 
parameters. On the other hand, the structure of the low-energy theory, we now see, 
depends on discrete choices: which Calabi—Yau, orbifold, etc., in how many dimen- 
sions, with how much supersymmetry, with which Wilson lines, and continuous 
dynamical quantities, the moduli. For any given choice, at least classically, it should 
be a straightforward problem to calculate the parameters of the low-energy theory. 

It is easy to calculate the four-dimensional gauge couplings in terms of the ten- 
dimensional dilaton and the radius. We have already seen how this works for simple 
compactifications, and this carries over directly to the Calabi-Yau case, since the 
vertex operators for the gauge fields are constructed in terms of two-dimensional 
fields, as in the orbifold or toroidal case. 

The cosmological constant is another interesting quantity in the low-energy 
theory. At the classical level in the Calabi-Yau compactifications, it vanishes. This 
can be understood in a variety of ways. First, if we examine the solution of the ten- 
dimensional equations of motion, we see that since the Ricci tensor vanishes, there 
is no cosmological term. Second, in the two-dimensional conformal field theory, 
the cosmological constant would give rise to a tadpole for the dilaton, but this 
is forbidden by conformal invariance. Ultimately, the absence of a cosmological 
constant is inherent in the form of the solution: we assumed that four dimensions 
are flat. We will see later that this is not necessary: string theory admits AdS spaces, 
as well as Minkowski spaces, as classical solutions. 

From the perspective of trying to understand the Standard Model, a particularly 
important set of parameters are the Yukawa couplings. These can certainly be 
computed in the string theory. In principle, we should construct the vertex operators 
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for the appropriate matter fields, and then construct the required OPE coefficients 
or suitable scattering matrices. In practice, this can often be short-circuited. In the 
orbifold models, for example, in the untwisted sectors we can read off the Yukawa 
couplings by dimensional reduction of the ten-dimensional Lagrangian. The scalar 
fields are components of the original ten-dimensional gauge fields, A;. Similarly the 
fermions are components of the ten-dimensional gauginos. In the orbifold theory, 
alternatively, it is not difficult to construct the vertex operators and to compute the 
required OPE coefficients. 

In the Calabi—Yau case, we have seen that, in the a’ expansion, the superpotential 
is independent of R. So one can work at very large radius, and pick off the leading 
contribution. To actually do the computation, one can construct the zero modes of 
the scalar and spinor fields, and substitute in the Lagrangian. A priori, one might 
expect that this would be quite difficult, given that one does not have an explicit 
formula for the metric. But it turns out that the Yukawa couplings have a topological 
significance, and their values can be inferred by general reasoning. We will not have 
use for explicit formulas here, but it is important to be aware of their existence. 


26.6.9 Other perturbative heterotic string constructions 


The quintic is just one of a large class of Calabi-Yau models which can be con- 
structed. The exact number is not actually known. It is not even known, with cer- 
tainty, whether the number of Calabi—Yau vacua is finite or infinite. 

So while we will not assess here the size of this space, there is clearly a large 
class of string solutions with gauge group identical to that of the Standard Model. 
These theories have varying numbers of generations, including both orbifold (or 
free fermion) models and Calabi—Yau constructions with three. There are many 
models with groups, numbers of generations, and other features radically different 
than those of the Standard Model. Still, it is remarkable how easily we have ob- 
tained models which accord with some of our speculations for Physics Beyond the 
Standard Model. We have found low-energy supersymmetry, coupling unification, 
light Higgs particles, discrete symmetries which can potentially suppress proton 
decay and give rise to a stable dark matter candidate, all in a framework where we 
can imagine that real calculations are possible. 

In subsequent chapters, we will turn to the problems of actually turning these 
observations and discoveries into a real theory which we can confront with 
experiment. 


Suggested reading 


Volume 2 of Green et al. (1987) provides a comprehensive introduction to Calabi- 
Yau compactification, and I have borrowed heavily from their presentation. Weakly 
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coupled string models with three generations have been constructed in the context 
of Calabi-Yau compactification; their phenomenology is considered by Greene 
et al. (1987). Models based on free fermions have been constructed by Faraggi 
(1999). We will encounter non-perturbative constructions in Chapter 28. At special 
points in their moduli spaces, some Calabi—Yau spaces can be described in terms 
of solvable conformal field theories. This program was initiated by Gepner (1987), 
and is described at some length by Polchinski (1998). A very accessible description, 
including computations of physically interesting couplings, appears in Distler and 
Greene (1988). 


Exercises 


(1) Write the field strength of electrodynamics as a two-form, and express its gauge in- 
variance in the language of forms. Verify that dF = 0 is the Bianchi identity (the 
homogeneous Maxwell equations). 

(2) Show that for a Kahler manifold, the non-vanishing components of the affine connection 
(Christoffel symbols) are given by Eq. (26.36). Then show that the non-zero components 
of the Riemann tensor are given by Eq. (26.37) and verify Eq. (26.38). Derive Eq. (26.40) 
by noting that 


2. = a5 Indet g. (26.76) 


Show that our result for the two-dimensional curvature of a Riemann surface is a special 
case of this. 

(3) For a flat, two-dimensional torus, introduce complex coordinates and verify that the 
bosonic and fermionic terms are just those of the free string action. You can take 
K = X'X for this case. 

(4) Write out the action of the heterotic string propagating in the Calabi-Yau background 
with spin connection equal to the gauge connection in some detail. Determine the form 
of the vertex operators for the 27 and 27 fields, in the RNS formulation (you can limit 
yourself to the NS—NS sector). 

(5) Exhibit a combination of Wilson lines and SU (5) singlet expectation values which break 
the gauge group to that of the Standard Model in the case of the quintic in CP4. 
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Dynamics of string theory at weak coupling 


In previous chapters, we have seen that string theory at the classical level shows 
promise of describing the Standard Model, and can realize at least one scenario 
for physics beyond: low-energy supersymmetry. But there are many puzzles, most 
importantly the existence of moduli and the related question of the cosmologi- 
cal constant. At tree level, in the Calabi-Yau solutions, the cosmological constant 
vanishes. But whether this holds in perturbation theory and beyond requires under- 
standing of the quantum theory. 
In studying string theory, we have certain tools: 


(1) weak coupling expansions, 
(2) long-wavelength (low-momentum, «’) expansions. 


We have exploited both of these techniques up to this point. In analyzing string 
spectra, we have worked in a weak coupling limit. There are corrections to the 
masses and couplings, for example, in string perturbation theory, and most of 
the states that we have studied have finite lifetimes. At weak coupling, these ef- 
fects are small, but at strong coupling, the theories presumably look dramatically 
different. 

In asserting that Calabi—Yau vacua are solutions of the string equations, we used 
both types of expansions. We wrote the string equations both in lowest order in 
the string coupling, and also with the fewest number of derivatives (two). Even at 
weak coupling and in the derivative expansion, we can ask whether Calabi—Yau 
spaces are actually solutions of the string equations, both classically and quantum 
mechanically. For example, we have seen that, at lowest order in both expansions, 
there are typically many massless particles. We might expect tadpoles to appear for 
these fields, both in the a’ and in loops. There is in general no guarantee we can 
find a sensible solution by simply perturbing the original one. 

Yet there are many cases where we can make exact statements. In both Type II 
and heterotic string theories, we can often show that Calabi-Yau vacua correspond 
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to exact solutions of the classical string equations. We can also show that they are 
good vacua — there are no tadpoles for massless fields — to all orders of the string per- 
turbation expansion. More dramatically, we can sometimes show that these vacua 
are good, non-perturbative states of the theory. This is perhaps surprising, since 
we lack a suitable non-perturbative formulation in which to address this question 
directly. The key to this magic is supersymmetry. In the framework of quantum 
field theory, we have already seen that supersymmetry gives a great deal of control 
over dynamics, both perturbative and non-perturbative. We were able to prove a va- 
riety of non-renormalization theorems from very simple starting points. The more 
supersymmetry, the more we could establish. The same is true in string theory. 
We can easily prove a variety of non-renormalization theorems for string perturba- 
tion theory. We can show that with N = 1 supersymmetry in four dimensions, the 
superpotential is not renormalized from its tree level form in perturbation theory; 
the gauge coupling functions are not renormalized beyond one loop. These same 
considerations indicate the sorts of non-perturbative corrections which can (and do) 
arise. In theories with more supersymmetries, one can prove stronger statements: 
the superpotential is not renormalized at all, and there are strong constraints on the 
kinetic terms. These sorts of results will be important when we try to understand 
weak-strong coupling dualities. 


27.1 Non-renormalization theorems 


In each of the superstring theories one can prove a variety of non-renormalization 
theorems. Consider, first, the case of ten dimensions. At the level of two derivative 
terms, the actions with N = 1 or N = 2 supersymmetry (16 or 32 supercharges) 
are unique. So, perturbatively and non-perturbatively, there is no renormalization. 
This is a variant of our discussion in four-dimensional field theories. If we di- 
mensionally reduce the Type II theories on a six-dimensional torus, we obtain a 
four-dimensional theory with 32 supercharges (NV = 8 in four dimensions); if we 
reduce the heterotic theory, we obtain a theory with N = 4 supersymmetry in four 
dimensions (16 supercharges). In either case, the supersymmetry is enough to pre- 
vent corrections to either the potential or the kinetic terms, not only perturbatively 
but non-perturbatively. 

These are quite striking results. From this we learn that the question of whether 
the universe is four-dimensional or not, or whether it has, say, four or eight su- 
persymmetries, or none, is not simply a dynamical question (at least in the naive 
sense of comparing the energies of different states or their relative stability). Other 
issues, perhaps cosmological, must come into play. We will save speculations on 
these questions for later. 
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27.1.1 Non-renormalization theorems for world sheet perturbation theory 


Let us turn, now, to compactified theories. Consider, first, a Type II theory com- 
pactified on a Calabi—Yau space. In this case, the low-energy theory has N = 2 
supersymmetry. Again, this is enough to guarantee that there is no potential gen- 
erated for the moduli, perturbatively or non-perturbatively. In other words, starting 
with a solution of the equations of the low-energy effective field theory, at lowest 
order in g, and R*/a’, we are guaranteed that we have an exact solution to all orders — 
and non-perturbatively — in both parameters. 

Now consider the compactification of the heterotic string theory on the same 
Calabi—Yau space, with spin connection equal to the gauge connection. Then the 
world sheet theory, as we saw, has two left-moving and two right-moving super- 
symmetries. It is identical to the theory which describes the corresponding Type 
II background. But we just established that the Calabi—Yau space is a solution of 
the classical string equations, which means that there is a corresponding super- 
conformal field theory with central charge c = 9. This is an exact statement; so 
the background corresponds to an exact solution of the classical string equations. 
This does not establish that the Calabi-Yau space corresponds to an exact vacuum 
quantum mechanically, as it does in the Type II case. For example, the intermediate 
states in quantum loops in the two theories are different. 

We can establish this result in a different way. Consider the h;,; (1, 1)-forms, 
be one of these is the Kahler form, where b;; = g;7. In world sheet perturbation 
theory we have seen that these fields decouple at zero momentum. The fact that 
all scattering amplitudes involving external b particles vanish at zero momentum 
has consequences for the structure of the low-energy effective Lagrangian: only 
derivatives of b appear in the Lagrangian. This is reminiscent of the couplings of 
Goldstone bosons; the Lagrangian, in world sheet perturbation theory, is symmetric 
under 


b(x) > b(x) +a (27.1) 


for constant a. 

This result implies a non-renormalization theorem for o-model perturbation 
theory; b lies in a supermultiplet with r*, the modulus which describes the size of 
the Calabi-Yau space. This is apparent from the fact that they are both Kaluza— 
Klein modes associated with the metric, g;7; r? is the symmetric part; b is the 
antisymmetric part. So this is similar to the situation in which we could prove non- 
renormalization theorems in field theory. Different orders of o-model perturbation 
theory are associated with different powers of r~*. But in holomorphic quantities 
such as the superpotential and gauge coupling function, additional powers of r~? 
are accompanied by powers of b. So only terms which are independent of r~7 are 
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permitted by the shift symmetry. As a result, the superpotential computed at lowest 
order is not corrected in o-model perturbation theory. This means that particles 
which are moduli at the leading order in ew’ are moduli to all orders of o-model 
perturbation theory. 

This non-renormalization theorem does not quite establish that these are good so- 
lutions of the classical string theory; there is still the possibility that non-perturbative 
effects in the o-model will give rise to potentials for the lowest-order moduli. In- 
deed, our argument for the vanishing of the b couplings is not complete. At zero 
momentum, the vertex operator for b, Vp, is topological; while it is the integral of 
a total divergence, it does not necessarily vanish. There generally exist classical 
Euclidean solutions of the two-dimensional field theory — instantons — for which 
the vertex operator is non-zero. These world sheet instantons raise the possibility 
that non-perturbative effects on the world sheet will lift some or all of the vac- 
uum degeneracy. For the (2, 2) theories, however, we already know that this does 
not occur. We earlier argued, by considering the compactification of the related 
Type II theories, that these corresponding sigma models are exactly conformally 
invariant. It is possible (and not terribly difficult) by examining the structure of 
the two-dimensional instanton calculation (““world-sheet instanton”) to show that 
no superpotential is generated. While we will not review this analysis here, the 
techniques involved are familiar from our discussion of four-dimensional instan- 
tons. One wants to determine whether instantons can generate a superpotential. 
One needs, as in four dimensions, to count fermion zero modes, and see if they 
can lead to a non-vanishing correlation function at zero momentum for an appro- 
priate set of fields. In the (2, 2) case, one finds that they cannot. One can then 
ask whether quantum corrections (small fluctuations) to the instanton result can 
yield such a correction. Here, one notes that, as in perturbation theory, holomorphy 
fixes uniquely the dependence on the coupling. So if the lowest-order contribution 
vanishes, higher orders vanish as well. 

In the case of (2,0) compactifications of the heterotic string, the situation is 
more complicated. Perturbatively, we can argue, as before, that solutions of the 
string equations at lowest order are solutions to all orders in the a’ expansion. 
Non-perturbatively, however, the situation is less clear. For such compactifications, 
there is no corresponding Type II compactification, so we can not rely on the magic 
of N = 2 supersymmetry. It is necessary to examine in detail the effects of world 
sheet instantons. In general, if one does the sort of zero-mode counting described 
above, one finds that it is possible to generate a superpotential. But in many cases, 
one can argue that there are cancellations, and the superpotential vanishes. 

It is important to understand that the non-renormalization theorems do not imply 
that the Calabi-Yau manifold is itself an exact solution to the classical string equa- 
tions; rather, the point is that there is guaranteed to exist a solution nearby. There 
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can be — and are — tadpoles for massive particles in o-model perturbation theory. 
A tadpole corresponds to a correction of the equations of motion: 


Vhtmh=l. (27.2) 


This is solved by a perturbatively small shift in the h field: 


r 
aaa ei (27.3) 

m 
For the massless fields, however, one cannot find a solution in this way, and in 
general, if there is a tadpole, there is no nearby (static) solution of the equations. 
This is why the low-energy effective action is such a useful tool in addressing such 


questions: it is precisely the tadpoles for the massless fields which are important. 


27.1.2 Non-renormalization theorems for string perturbation theory 


In field theory, we proved non-renormalization theorems by treating couplings as 
background chiral fields, and exploring the consequences of holomorphy of the 
effective action as functions of these fields. In string theory, we have no coupling 
constants, but the moduli determine the effective couplings, and since they are 
themselves fields, they are restricted by the symmetries of the theory. We exploited 
this connection in the previous section to prove non-renormalization theorems for 
o-model perturbation theory. In this section, we prove similar statements for string 
perturbation theory. 

We begin with the heterotic string theory, on a Calabi-Yau manifold or an orb- 
ifold. In this case, we saw that there is a field, S, which we called the dilaton 
(it is sometimes called the four-dimensional dilaton). The vertex operator for the 
imaginary part of this field, a(x), at k = 0, is simply: 


Vie i do ag X" Ip X Duy. (27.4) 


This is, again, a total derivative on the world sheet. So this particle, which we saw 
earlier is an axion, decouples at zero momentum. So again there is a shift symmetry — 
this is just the axion shift symmetry. Again, this means that the superpotential must 
be independent of S. But since powers of perturbation theory come with powers 
of S, this establishes that the superpotential is not renormalized to all orders of 
perturbation theory! 

As in the world sheet case, there can be non-perturbative corrections to the 
superpotential, and this raises the possibility that potentials will be generated for 
the moduli. We will see shortly that gluino condensation, as in supersymmetric 
field theories, is one such effect. 
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First, we consider other string theories. In the case of Type II compactified 
on a Calabi—Yau space, the N = 2 supersymmetry is enough to insure that no 
superpotential is generated perturbatively or non-perturbatively: Calabi—Yau spaces 
correspond to exact ground states of the theory, and the degeneracies are exact as 
well. As in field theories with N = 2 supersymmetry, corrections to the metric 
(Kahler potential) are possible. Theories with more supersymmetry (heterotic on 
tori or Type II theories on K3 spaces with N = 4 supersymmetry, or Type II on tori 
with eight supersymmetries) are even more restricted. 


27.2 Fayet—Iliopoulos D-terms 


In deriving the non-renormalization theorems for string perturbation theory, we 
established that there is no renormalization of the superpotential, or of the gauge 
coupling function beyond one loop. But this is not quite enough to establish that 
there is no renormalization of the potential. We must also check whether Fayet— 
Tliopoulos terms are generated. From field-theoretic reasoning, we might guess that 
any renormalization would occur only at one loop. In globally supersymmetric 
theories in superspace, a Fayet—Iliopoulos term has the form: 


CD= / d‘ov. (27.5) 


This term is just barely gauge invariant; under V > V + A + A‘, this is invariant 
because d*@.A = Osince A is chiral. If we treat the gauge coupling (or any other 
couplings) as background fields, any would-be corrections to D would have the 
form: 


/ d*69(S, S')V (27.6) 


which is only invariant if g is a constant. Thus any D-term is independent of the 
coupling, in the normalization where 1/g? appears in front of the gauge terms. So 
at most there is a one-loop correction. 

Before going on to string theory, itis interesting to look at the structure of any one- 
loop term. Call the associated U(1) generator Y. If supersymmetry is unbroken, 
massive fields come in pairs with opposite values of Y, so only massless fields 
contribute. The Feynman diagram which contributes to the D-term is shown in 
Fig. 27.1. It is given by: 

5 ake 
c= Tuy) [ Ont (27.7) 
So a vanishing D-term requires that the trace of the U(1) generator vanish. The 
one-loop diagram is quadratically divergent, but let’s rewrite this in a way which 
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Fig. 27.1. The Feynmann diagram which contributes to the D-term. 


resembles expressions we have seen in string theory. We can introduce a “Schwinger 
parameter,’ which we will call t2. Then: 


2 i d*k —2n tok” 
C = 2nTr(Y) é dt Qn) (27.8) 


1 lo) dt 1/2 
= —— Tr(Y = dq. 
32x ( rf ty be : 


We have written things in this way because we want to think think of this as an 
integral over the modular parameter of the torus. At this stage, the integral is still 
quadratically divergent. But, under modular transformations, the complex t plane 
is mapped into itself several times. We can define a fundamental domain, 


1 1 

=F7 2 SS 
2 2 

If we restrict the integration to the fundamental domain, the result is finite. In string 

theories, this turns out to be the correct answer: 

1 
= —Tr 
1927? 


This result can be derived by a straightforward string computation. But instead, 


es ea (27.9) 


aa y (27.10) 


in string models where Tr(Y) is non-zero, we can give a low-energy field theory 
argument which completely fixes the coefficient of the D-term, and also sheds light 
on possible perturbative corrections. If Tr(Y) 4 0, the low-energy theory has a 
gravitational anomaly. This anomaly is rather similar to the gauge anomalies we 
have discussed in field theory. It arises from a diagram with one external gauge 
boson and an external graviton. String models with such anomalies typically have 
gauge anomalies as well, which we can readily evaluate. As an example, consider 
the compactification of the O(32) heterotic string on a Calabi-Yau space, with 
spin connection equal to the gauge connection. In this case, the low-energy gauge 
group is SO(26) x U(1). There are h,,; 26s with U(1) charge 1, and h2,; 26s with 
U(1) charge —1. There are also corresponding singlets, with charge +2 and —2 
respectively. These are in precise correspondence with the fields we found in E¢; 
the 26s arise in parallel to the O(10) 10s; the singlets to the O(10) singlets. But 
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now it is clear that there are anomalies in the gauge symmetries. For example, there 
is aU(1) x O(26)* anomaly proportional to 


A = (ho, — 41) (27.11) 
and a U(1)° anomaly: 
A’ = (ho, — hy,1)(26 — 8). (27.12) 


On the other hand, this is a modular invariant configuration of string theory, so 
there should not be any inconsistency, at least in perturbation theory. So something 
must cancel the anomaly. The cancellation is actually a variant of the mechanism 
discussed originally by Green and Schwarz in ten dimensions, now specialized to 
four dimensions. We know that there is a coupling: 


/ d’aswe. (27.13) 


This gives rise to a coupling of the axion to the FF terms of each group. The 
anomaly calculation in the low-energy theory implies a variation of the action 
proportional to the anomaly coefficient and F F’. So if the axion transforms under 
the gauge symmetry as 


a(x) > a(x) + ca(x) (27.14) 


this can cancel the anomaly. It is crucial that the anomaly coefficients are the same 
for each group. 

We can check whether this hypothesis is correct. If a(x) transforms, then it must 
couple to the gauge field. The required covariant derivative is 


1 
D,a = d,a — —A,. (27.15) 
c 


So from the kinetic term in the action, there is a coupling of A, to a. One can 
compute this coupling without great difficulty and verify that it has the required 
magnitude. 

More interesting, however, is to consider the implications of supersymmetry. We 
can generalize the coupling above to superspace. The transformation law for a now 
becomes a transformation law for S: 


SSH Naa! (27.16) 


where A is the chiral gauge transformation parameter. The gauge-invariant action 
for S is: 


_ [4 ;_l 
doin{S+st—-v). (27.17) 
Cc 
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If we Taylor series expand this Lagrangian, we see that, in addition to the A,,d“a 
coupling, we generate a Fayet—Iliopoulos D-term: 


1 
4 


One can verify that this term — and the other terms implied by this analysis — are 
present. First, we can ask: at what order in perturbation theory should each of these 
terms appear? To establish this, we need to remember that the standard supergravity 
Lagrangian is written in a frame where M, appears in front of the Einstein term 
in the effective action. In the string frame, it is the dilaton — essentially S — which 
appears out front. If we rescale the four-dimensional metric by: 


Suv > S8uv (27.19) 


then S appears in front of the Lagrangian. With this same rescaling, the “kinetic” 
term, which had an S out front, has S*. The Fayet—Iliopoulos D-term, originally 
has a 1/S out front. Correspondingly, the resulting scalar mass term would be 
proportional to 1/5”. After the metric rescaling, this is independent of S, i.e. in the 
heterotic string theory, the D-term should appear at one loop, in accord with our 
field theory intuition. Similarly, the coupling A,,0”a should appear at one loop, 
while there should be a contribution to the cosmological constant at two loops. All 
of these can be found by straightforward string computations (some of these are 
described in the Suggested reading). 

In essentially all known examples, this one-loop D-term does not lead to super- 
symmetry breaking. There always seem to be fields which can cancel the D-term. 
Consider, again, the O(32) theory. Here we can try to cancel the D-term by giving 
an expectation value to one of the singlets, 1_2. The question is whether this gives 
a non-zero contribution to the potential when we consider the superpotential. The 
most dangerous coupling is a term 1_214+2 involving some other singlet. But such 
terms are absent at lowest order, and their absence to higher orders is guaranteed by 
the non-renormalization theorems. Charge conservation forbids terms of the form 
1” ,; there are no other dangerous terms. So this corresponds to an exact “F-flat” di- 
rection of the theory. So in perturbation theory there exists a good vacuum. While a 
general argument is not known, empirically this possibility for cancellation appears 
to arise in every known example. 

What does the theory look like in this new vacuum? 


(1) Supersymmetry is restored and the vacuum energy vanishes. 

(2) The U(1) gauge boson has a mass-squared of order g? times the string scale. 

(3) The longitudinal mode of the gauge boson is principally the imaginary part of the 
charged scalar field whose vev canceled the D-term. There is still a light axion. 
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From the perspective of a very-low-energy observer, the D-term is not a dramatic 
development. It plays some role in determining physics at a very-high-energy scale 
(albeit not quite as high as the string scale). What is perhaps most impressive is 
the utility of effective field theory arguments in sorting out a microscopic string 
problem. Prior to the discovery of the D-term, for example, there had been many 
papers “proving” a strict non-renormalization theorem for the potential; this, we 
see, is not correct (it is not hard to determine, in retrospect, what went wrong in the 
original proofs). The effective field theory arguments make clear when the potential 
is renormalized in perturbation theory and when it is not. They also permit easily 
finding the “new vacuum” in cases where a Fayet—Iliopoulos term appears. It is 
possible, in principle, to find this vacuum by stringy methods, but this is distinctly 
more difficult. Finally, these arguments give insight into the non-perturbative fate 
of the non-renormalization theorems. 


27.3 Gaugino condensation 


We have seen that in string theory, if supersymmetry is unbroken at tree level, it 
is unbroken to all orders of perturbation theory. The argument, as in field theory, 
allows exponential dependence on the coupling. In the case of the heterotic string 
compactified on a Calabi—Yau space, gaugino condensation, as in supersymmetric 
field theories, generates a superpotential on the moduli space. 

Consider the Eg x Eg theory compactified on a Calabi—Yau space, with spin 
connection equal to the gauge potential, and without Wilson lines. In this case, 
there is an Eg x Eg gauge symmetry. There are typically several fields in the 27 
of E¢, but there are no chiral fields transforming in the Eg. So one has a pure Eg 
supersymmetric gauge theory. The couplings of the E~ and Eg are equal at the 
high scale, so the Eg coupling becomes strong first. This leads, as we have seen, to 
gaugino condensation. We have also seen that at tree level there is a coupling: 


Swe. (27.20) 
Just as before, this leads to a superpotential for S: 
W(S) = AeW?5/, (27.21) 


One often hears this described as a “field theory analysis,” as if itis not necessarily 
a feature of the string theory. But string theory obeys all of the principles of quantum 
field theory. If we correctly integrate out high-energy string effects, the Jow-energy 
analysis is necessarily reliable. So the only question is: are there terms in the low- 
energy effective action that lead to larger effects. One might worry that, since we 
understand so little about non-perturbative string theory, it would be hard to address 
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this. But with some very mild assumptions, we can establish that the low-energy 
effects are parametrically larger than any high-energy string effects. 

The basic assumption is that, as in field theory, non-perturbatively the theory 
obeys a discrete shift symmetry (for a suitable normalization of a): 


a(x) > a(x) + 27. (27.22) 


When we discuss non-perturbative string theory, we will give some evidence for 
this; this will turn out to be one of the more mild assertions in the subject of string 
duality. For now, note that if we accept this, any superpotential for S arising from 
high-energy string effects is of the form 


Wie Ge (27.23) 


for integer n. So such effects are exponentially smaller than gaugino condensation. 

What does the low-energy theory look like? The dilaton potential goes rapidly 
to zero for large S, i.e. in the weak coupling limit. We might have hoped that 
somehow we would find supersymmetry is broken and the moduli are fixed. But 
instead, gaugino condensation leads to a runaway potential. At large S$, we have 
just argued that no additional stringy effects can stabilize this behavior. 

We can imagine more elaborate versions of this phenomenon, involving matter 
fields as well, in some sort of hidden sector. But it is difficult to construct models 
where the moduli are stabilized in any controlled fashion along these lines. 


27.4 Obstacles to a weakly coupled string phenomenology 


We have seen that string theory is a theory without dimensionless parameters. This 
is an exciting prospect, but it also raises the question: how are the parameters of 
low-energy physics determined? We have argued that the answer to this question 
lies in the dynamics of the moduli: the expectation values of these fields determine 
the couplings in the low-energy Lagrangian. 

In non-supersymmetric string configurations, perturbative effects already lift the 
degeneracy among different vacua, giving rise to a potential for the moduli. In 
the previous section, we have learned that in supersymmetric compactifications 
non-perturbative effects generically lift the flat directions of the potential. In other 
words, the moduli are not truly moduli at the quantum level. At best, we can speak of 
approximate moduli in regions of the field space where the couplings are weak. The 
potentials, both perturbative and non-perturbative, all tend to zero at zero coupling. 
This is not surprising; with a little thought, it becomes clear that this behavior is 
not specific to perturbation theory or some particular non-perturbative phenomenon 
such as gaugino condensation. At very weak coupling, we expect that the potential 
always tends rapidly to zero. This means that if the potential has a minimum, this 
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occurs when the coupling is not small. This is troubling, for it means that it is likely 
to be hard —if possible at all — to do computations which will reveal detailed features 
of the state of string theory (if any) which describes the world we see around us. 

In the next chapter, we will see that much is known about non-perturbative string 
physics. Most striking are a set of dualities, which relate regimes of very strong 
coupling in one string theory to weak coupling in another. While impressive, these, 
by themselves, do not help with the strong coupling problem we have elucidated 
above. If, at very strong coupling, the theory is equivalent to a weakly coupled 
theory, the potential will again tend to zero. In other words, it is likely that stable 
ground states of string theory exist only in regions where no approximation scheme 
is available. 

Perhaps just as troubling is the problem of the cosmological constant. Neither 
perturbative nor non-perturbative string theory seems to have much to say. The 
potentials are more or less of the size one would guess from dimensional analysis 
(and the expected dependence on the coupling). Perhaps most importantly, they are, 
up to powers of coupling, as large as the scale set by supersymmetry breaking. 

There are, however, some reasons for optimism. Perhaps the most important is 
provided by nature itself: the gauge and Yukawa couplings of the Standard Model 
are small. Another is provided by string theory. As we will discuss later, there are 
ways in which large pure numbers can arise. These might provide mechanisms to 
understand the smallness of couplings, even in situations where asymptotically the 
potential vanishes. Finally, we will see that there is, at present, only one proposal 
to understand the smallness of the cosmological constant, and string theory may 
provide a realization of this suggestion. 


Suggested reading 


The result that there are no continuous global symmetries in string theory is a 
fundamental one. For the heterotic theory, it appears in Banks and Dixon (1988). 
Non-renormalization theorems for world sheet perturbation theory and issues in 
construction of (0, 2) models are described by Witten (1986) and by Green et al. 
(1987). The non-renormalization theorem for string perturbation theory is described 
by Dine and Seiberg (1986). The space-time argument for the Fayet—Iliopoulos D- 
term appears in Dine et al. (1987c); world sheet computations appear in Atick 
et al. (1987) and Dine et al. (1987a). World sheet instantons are discussed in Dine 
et al. (1986, 1987b); cancellations of instanton effects relevant to (0, 2) theories are 
studied by Silverstein and Witten (1995). 
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Beyond weak coupling: non-perturbative string theory 


In the previous chapter, we were forced to face the fact that string theory, if it de- 
scribes nature, is not weakly coupled. On the other hand, the very formulation we 
have put forward of the theory is perturbative. We have described the quantum me- 
chanics of single strings, and given a prescription for calculating their interactions 
order by order in perturbation theory in a parameter g,. There is a parallel here to 
Feynman’s early work on relativistic quantum theory: Feynman guessed a set of 
rules for computing perturbative amplitudes of electrons. In this case, however, one 
already had a candidate for an underlying description: quantum electrodynamics. It 
was Dyson who clarified the connection. For Abelian theories, the non-perturbative 
theory probably does not really exist, but in the case of non-Abelian gauge theories 
it does. The field theoretic formulation provides an understanding of the underlying 
symmetry principles, and access to a trove of theoretical information. 

A string field theory would be a complicated object. The string fields themselves 
would be functionals of the classical two-dimensional fields which describe the 
string. The quantization of such fields is sometimes called “third quantization.” 
Much effort has been devoted to writing down such a field theory. For open strings, 
one can write relatively manageable expressions which reproduce string perturba- 
tion theory. For closed strings, infinite sets of contact interactions are required. But 
apart from their cumbersome structure, there are reasons to suspect that this is not a 
useful formulation. There would seem to be, for example, vastly too many degrees 
of freedom. At one loop, we have seen that string amplitudes are to be integrated 
only over the fundamental region of the moduli space. Naively a field theory which 
simply describes all of the states of the string would have amplitudes integrated 
over the whole region, and the cosmological constant would be extremely divergent. 
The contact terms mentioned above solve this problem, but not in a very satisfying 
way. 

Despite this, there has been great progress in understanding non-perturbative as- 
pects of the known string theories. Most strikingly, it is now known that all theories 
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with 16 or more supersymmetries are the same. Many tools have been developed 
to study phenomena beyond string perturbation theory, especially D-branes and 
supersymmetry. There exist some cases where non-perturbative formulations of 
string theory are possible, and we will discuss them briefly in this chapter. They 
are technically and conceptually much simpler than string field theory. They have 
a puzzling, perhaps disturbing feature, however: they are special to strings prop- 
agating in particular backgrounds. It is as if, in Einstein’s theory, for each pos- 
sible geometry, one had to give a different Hamiltonian. All of these results are 
“empirical.” They have been developed by collecting circumstantial evidence on a 
case-by-case basis. There is still much which is not understood. In the last chapter, 
we will discuss how this developing understanding might lead to a closer connection 
of string theory to nature. 


28.1 Perturbative dualities 


Before considering examples of weak-strong coupling dualities, we return to the 
large/small radius duality we studied in Section 25.3. Many of the dualities we will 
study have a similar flavor, even though they cannot be demonstrated so directly. 
We saw that there is an equivalence of the heterotic string theory at small radius 
to the theory at large radius. By examining the action of these transformations at 
their fixed points, we saw that these duality symmetries are gauge symmetries. We 
could ask, as well, the significance of duality transformations in the IIA and IIB 
theories. As in other closed strings, in addition to transforming the radii, the duality 
transformation takes: 


ON? Sh SON OX OX, (28.1) 


Because of world sheet supersymmetry, it has the same action on the fermions; 
Ww? > —w°; w°? > w?’. But under this the chirality operator appearing in the GSO 
projector is reversed in sign, i.e. duality interchanges the ITA and IIB theories; the 
small-radius IIA theory is equivalent to the large-radius IIB theory, and vice versa. 
There are other perturbative connections. For example, the compactified O(32) 
heterotic string theory is equivalent to the Eg x Eg theory. 


28.2 Strings at strong coupling: duality 


Duality is a term used in physics to label different descriptions of the same physical 
situation. At the level of perturbation theory, we have learned about five apparently 
different string theories. Based on the perturbative dualities discussed above, we 
see that there are at most three inequivalent string theories, the Type I, Type I, and 
heterotic theories. But it is tempting to ask whether there are more connections. In 
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this chapter, we will see that all of the known string theories are equivalent in a 
similar way, but these equivalences relate small and large coupling. For example, the 
strong coupling limit of the O(32) heterotic string theory is the weak coupling limit 
of Type I string theory; the strongly coupled limit of Eg x Eg, compactified to six 
dimensions on a torus, is the weakly coupled limit of Type II theory compactified on 
a K3 manifold (K3 manifolds are essentially four-dimensional Calabi—Yau spaces); 
the ten-dimensional Type II theory is self-dual, and, perhaps most intriguingly of all, 
the strong coupling limit of Type IIA theory in ten dimensions is described, at low 
energies, by a theory whose low-energy limit is eleven-dimensional supergravity. 

Lacking a non-perturbative formulation of the theory, the evidence for these 
connections is necessarily circumstantial. While circumstantial, however, it is com- 
pelling. All of the evidence relies on supersymmetry. We will not be able to review 
all of this here, but will try to give the flavor of some of the arguments. Supersym- 
metry, especially supersymmetry with 16 or 32 supercharges, allows one to write 
a variety of exact formulas, for Lagrangians (based on strong non-renormalization 
theorems) and for spectra (based on BPS formulas) which can be trusted in 
both weak and strong coupling limits. This allows detailed tests of the various 
dualities. 


28.3 D-branes 


When we discussed strong—weak (electric—magnetic) dualities in field theory, topo- 
logical objects played a crucial role. The same is true in string theory, where the 
solitons are various types of branes. In general, a p-brane is a soliton witha p + 1- 
dimensional world volume, so a 0-brane is a particle, a 1-brane a string, a 2-brane a 
membrane, and so on. In general, one might construct these by solving complicated 
non-linear differential equations. But a large and important class of topological ob- 
jects can be uncovered in string theory in a different — and much simpler — way. 
These are the D-branes. These branes fill an important gap in our understanding 
of the Type I and Type II theories. In these theories, we encountered gauge fields 
in the Ramond—Ramond sectors: two-forms in Type I, one-forms and three-forms 
in the ITA theories, zero-forms, two-forms, and four-forms in the IIB. One natural 
question is: where are the charged objects that couple to these fields? They are 
not within the perturbative string spectrum. The vertex operators for these fields 
involved the gauge-invariant field strengths only, so in perturbation theory there 
are no objects with minimal coupling. The answer is the D-branes. Their masses 
(tensions) are proportional to 1/g,, so at weak coupling they are very heavy. This 
is why they are not encountered in the string perturbation expansion. 

When we discussed open strings, we noted that there are two possible choices 
of boundary conditions: Neumann and Dirichlet. At first sight, Neumann boundary 
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conditions appear more sensible; Dirichlet boundary conditions would violate trans- 
lational invariance, implying that strings end at a particular point(s). But we have 
already encountered violations of translational invariance within translationally 
invariant theories: solitons, such as magnetic monopoles or higher-dimensional ob- 
jects like cosmic strings and domain walls. Admitting the possibility of Dirichlet 
boundary conditions for some or all of the coordinates leads to a class of topologi- 
cal objects known as D-branes (for Dirichlet branes). If d — p — 1 of the boundary 
conditions are Dirichlet, while p + 1 are Neumann, the system is said to describe 
a Dp-brane. 

We can be quite explicit. Start first with the bosonic string. For the Neumann 
directions, we have our previous open string mode expansion of Eq. (21.16). For 
the Dirichlet directions, we have: 


1 ; 
Xiaxgti>) ane sinino) I=1,d—p-—1. (28.2) 
n#0 


Note that there are no momenta associated with the Dirichlet directions. The xs 
should be thought of as collective coordinates. We will argue shortly that the tension 
of the branes is proportional to M? v /&s- 

Consider an “extreme” case, that of a DO-brane. There are 25 collective coor- 
dinates and no momenta, so this object is a conventional soliton. In field theory, 
the excitations near the soliton, which describe scattering of mesons (field theory 
excitations) from the soliton must be found by studying the eigenfunctions of the 
quadratic fluctuation operator. But here they are very simple: they are just the ex- 
citations of the open string. As a second example, consider a D3-brane. Now the 
momentum has four components. So the excitations which propagate on the brane 
are four-dimensional fields. These break up into two types. The Neumann fields, 
X", give rise to a massless gauge boson, the state alt ,|0); the Dirichlet fields, X q 
give rise to massless scalars on the brane a! ,|0). In the superstring version of this 
construction, there are six scalars and a gauge boson, and their superpartners. In 
N = 1 language, this is a vector multiplet and three chiral multiplets, the content 
of N = 4 Yang-Mills theory with gauge group U(1). 

Before considering some of these statements in greater detail, let us explore a 
further aspect of this construction. Suppose we have several branes, say D3-branes, 
parallel to each other. Here parallel just means that the strings which end on these 
branes have Dirichlet or Neumann boundary conditions in the same direction. Now, 
however, we have the possibility that the strings end on different branes. Take the 
simplest case of two branes. If the branes are separated by a distance r, in addition 
to the modes above, labeled by the collective coordinate ae ,1 = 1,2, we have to 
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allow for expansions of the form: 


X!(0,t) = xf to—(xi — x?) +i SS bgt ane sinfno) JT=1,...,d—p-—l. 
a poral 

(28.3) 
There are two such configurations, one starting on the first brane and ending on the 
second, and one starting on the second and ending on the first. The ground states 
in these sectors have mass-squared proportional to r?. For r ¥ 0, all of these states 
are massive. The massless bosons consist of a U(1) gauge boson on each brane, 
as well as scalars. As r — 0, we have two additional massless gauge bosons. If 
we generalize to n branes, we have n massless gauge bosons and 6n scalars; as we 
bring the branes close together, we have n* gauge bosons and 6n? scalars. 

There is a natural conjecture as to what is going on here. When all of the branes 
are on top of one another, we have a U(n) gauge symmetry, with 3n complex 
scalars transforming in the adjoint representation of the group. As the branes are 
separated, the adjoint scalars acquire (commuting) expectation values; these break 
the gauge symmetry to U(1)”, giving mass to the other gauge bosons. In principle, 
we would like to check that these n* gauge bosons interact as required for Yang— 
Mills theories, as we did for the gauge bosons of the heterotic string. This is more 
challenging here, since we need vertex operators which connect strings ending on 
different branes and we will not attempt this. We will provide further evidence for 
the correctness of this picture shortly. 

The branes break some of the supersymmetry of the Type II theory in infinite 
space; instead of 32 conserved supercharges, there are 16. A simple way to un- 
derstand this uses the light cone gauge construction. There are now open strings 
ending on the brane. For the world sheet fermions, the boundary conditions relate 
the left and right movers on the string. Calling these S, and S,, we have 


ioc = So Ore Os. Saya yee ee (28.4) 
Recall that half of the supercharges have the very simple form: 
OQ = / doS? Q%= / do S“ (28.5) 


so Q* = Q". This is the structure of a broken supersymmetry generator, with S the 
goldstino. The same is true for the other set of supercharges. Other configurations 
of branes, such as non-parallel sets of branes, preserve less supersymmetry. Brane— 
anti-brane configurations preserve no supersymmetry at all. 

We can imagine other sets of branes, which would respect different amounts of 
supersymmetry. If we have branes which are not parallel, for example, different sets 
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of supersymmetries will be preserved. In order to count supersymmetries, we need 
to compare the supersymmetries on different branes, at different angles relative to 
one another. 


28.3.1 Brane charges 


We have seen that the simplest D-brane configurations preserve half of the super- 
symmetries. In other words, they are BPS states. Typically BPS states are associated 
with conserved charges. In the case of IIA and the IIB theories, in the Ramond— 
Ramond sectors there are gauge fields, but, in perturbation theory, no charged 
objects. Polchinski guessed — and showed — that the objects which carry Ramond— 
Ramond charges are D-branes. In the ITA case, the gauge fields are a one form anda 
three form; in the IIB case they are a zero-form, a two-form, and a (self-dual) four- 
form. In relativistic mechanics, a gauge field couples to a particle — a zero-brane. 
We have seen that a two-index tensor couples naturally to a string — a one-brane. 
So this suggests that in the ITA theory, there should be Dp-branes with p even, cou- 
pling to the corresponding R-R gauge fields, while in the IIB theory there should be 
Dp-branes with p odd. Polchinski verified this by direct calculation. He computed 
the one-loop amplitude for two separated branes. For large separations, he found 
the poles associated with exchange of the massless gauge fields (more precisely, 
for fixed separation, r, one should see falloff with powers of 1/r). His calculation 
not only yields the brane charges, but it also gives the brane tensions. 

Consider the case of two branes, separated by a distance y. In empty flat space, 
the trace over states in the one-loop amplitude for open strings gives a result of the 
form: 

[oe 
A=cf S. (28.6) 
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The power of f arises from the momentum integral, { d 8k exp(—k*), as well as 
from the manipulation of the oscillator traces. The main difference in the case of 
two separated branes is that the mass-squared has a contribution from the brane 
separation, 9, and 9 — p coordinates of the brane are fixed, so they don’t have 
associated momenta. So the result has the form: 


at ty? 
A= c| 3 Brat) Pexp (- 2 ) 
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Here G,(y) is the scalar Green function in d dimensions. So one can think of a 
potential between the branes associated with the exchange of massless states. These 
massless states are antisymmetric tensor fields and their superpartners, as well as 
gravitons and gravitinos. These contributions can be isolated, and the tensions and 
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charges of the D-branes determined. In the case of the superstring, the full potential 
vanishes due to boson and fermion cancellations. 


28.3.2 Brane actions 


We are familiar with the actions for zero-branes and one-branes. The action for a 
general p-brane is a generalization of these: 


2 4 axMax? \ 
= Tf | a? & de (= aE a . (28.8) 


In the zero-brane case, this is the action for a particle; X“(t) is the collective 
coordinate which describes the position of the soliton, and 7p is its mass. For a 
general background with a bulk metric, dilaton and antisymmetric tensor field this 
generalizes to 


Sp =-T, i: d?*'ge~®[— det(Gap + Bay + 20a Fyy)]'/?. (28.9) 


The terms involving the metric and antisymmetric tensor are similar to those we 
have encountered elsewhere in string theory, and their form is not surprising. The 
e~° reflects the fact that in the open string sector, the coupling constant is the square 
root of that of the closed string sector. 


28.4 Branes from 7-duality of Type I strings 


There is another way to think about D-branes, which provides further insight. We 
have seen that closed string theories exhibit a duality between large and small radius. 
In the heterotic theory there is an exact equivalence of the theories at large and small 
radius, which can be understood as a gauge symmetry. In Type II theories, 7'-duality 
relates two apparently different theories. It is natural to ask what is the connection 
between large and small radius in theories with open strings. Open strings have 
momentum states, but no winding states. So there cannot be a self-duality. Instead, 
we look for an equivalence between the open string theory at one radius and some 
other theory at the inverse radius. Here we uncover D-branes. 

Consider the boundary conditions on the strings in the compactified direction. 
For the closed string fields, the effect of the duality transformation is to take: 


X, _—> X, XR —> —Xp. (28.10) 


In terms of left- and right-moving bosons in open string theories, Neumann bound- 
ary conditions are the conditions 


0,X = (05, + dg_)X = 0. (28.11) 
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So after a T-duality transformation, we would expect 
(0c, — 06_)X = 0,X = 0, (28.12) 


i.e. we have traded Neumann for Dirichlet boundary conditions. While this follows 
from simple calculus manipulations, it is instructive to formulate this in terms of 
the mode expansion for the open string. Ordinarily, we have: 


1 1 1 ; : 
xX? =x? + —p(t+o)+—p(t—o)+i > = (ape EF) 4 Qe iM), 
2 2 nm” 
(28.13) 


The effect of the duality transformation is to change the sign of the terms which 
depend on t — o. So instead of writing an expansion in terms of cosines, we have 
an expansion in terms of sines: 


Xo = x3 + po +i 3 EP ry (28.14) 
n#0 Mt 
These are precisely the Dirichlet branes. Note the role of p: in the 7-dual picture, it 
is a sort of winding: it describes strings which start on the brane, wind around the 
compact dimension some number of times, and then end on the brane. 

This T-duality of open strings also allows us to better understand the appearance 
of gauge interactions associated with stacks of branes. In the original open string 
picture, gauge degrees of freedom are described by Chan—Paton factors, i.e. charges 
on the ends of the string. In the case of Type I strings, these are described by states 
of the form |AB), A, B =1,...,32. Now consider a U(16) subgroup of O(32). 
The string ends carry labels, i, 7, within U(16). Taking the diagonal generators of 
U(N) to be the matrices 


T,; = diag(1,0,0,...) 7) = diag(O,1,0,...), (28.15) 


etc., the state (7, j) carries charge —1 under 7;, +1 under 7;, and zero under the 
other generators. 

We can consider constant, background gauge fields in the 9 direction. We can 
write these as: 


A = diag(d, dz, ..., G16). (28.16) 
This has a gauge-invariant description in terms of the Wilson line: 
U = ei f dea, (28.17) 


where the integral is taken in the periodic directions. Such a background gauge field 
breaks the gauge symmetry to U(1)!°, in general; the other gauge bosons should 
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gain mass. In field theory, the corresponding mass terms are proportional to 
[A”, A’? (28.18) 


so the diagonal gauge bosons are massless, and those corresponding to the non- 
Hermitian generator 


ibis = 575, (28.19) 
have mass-squared 
m? = (a; — aj)’. (28.20) 


This is similar to the calculations we did of symmetry breaking in grand unified 
theories. 

We would like to understand how this result arises directly in the string theory. 
It is simplest to consider the case of a string which is constant in o. The coupling 
of the string depends on the Chan—Paton factors. In the light cone, the action in the 
presence of a gauge field is like that of a particle: 


lt eg es OE ae 
>| T (=) BC aS em : (28.21) 


For a non-constant string, the situation is somewhat more complicated, since the 
gauge fields couple at the string end points. 
The extra term modifies the canonical momenta. These are now: 
n ax? 


P=Zaa7tai—ay). (28.22) 


This means that the leading term in the string mode expansion is: 


xO= (= Zafgetes a;)) t. (28.23) 


This gives an extra contribution to the mass. If = 0, this is exactly what we expect 
from field-theoretic reasoning. 

Now let’s consider the 7-dual picture. Under T-duality, the zero-mode part of 
X transforms into: 


Sg (= — (a; - aj) 0. (28.24) 


For i = j, this corresponds to a string beginning and ending on the same D-brane. 
Fori  j, the string ends at different points, i.e. on separated D-branes. At least for 
the Type I theory, we have derived the picture we conjectured earlier: a stack of N 
coincident branes describes a U(N) gauge symmetry; as the branes are separated, 
the gauge symmetry is broken by a field in the adjoint representation. 


450 28 Beyond weak coupling: non-perturbative string theory 


28.4.1 Orientifolds 


We have seen that we can understand the appearance of D-branes by considering 
T-duality transformations of open strings. The Type I theory is a theory of oriented 
strings. In the closed string sector, the action has a parity symmetry, which inter- 
changes left and right on the world sheet. Calling the corresponding operator (2, one 
keeps only states which are invariant under the action of (2. This is necessary for 
the consistency of interactions of open and closed strings. This means that closed 
string states like 


a_2@_1G_1|0) (28.25) 
are not allowed, but symmetrized combinations such as 


a 20 1a 1+a@ 201 1|0) (28.26) 


are permitted. This projection is similar to the orbifold projections we have encoun- 
tered earlier. 

Consider the action of & in the T-dual theory. We have seen that in terms of the 
original fields, 


X” = =X? + Xp. (28.27) 


So the effect of interchanging left and right is to change the sign of X°, ie. Qisa 
combination of a world sheet parity transformation and a reflection in space-time. 

The effect of this projection on states is similar to a Z2 orbifold projection. We can 
combine momentum states to form states with definite transformation properties 
under the reflection: 


Ip) +| — p). (28.28) 


Gravitons, for example, in the non-compact directions, G,,,, must have momentum 
states which are even; in coordinate space, this means that graviton states must be 
even functions of x. The fields G,,9 must be odd functions, and so on. It is as if there 
is an entity, the orientifold, sitting at the origin — the fixed point of the reflection. 
This object in fact has a negative tension. One way to see this is simply to note 
that the effect of the 7-duality transformation was to produce a set of D-branes. 
These branes have a positive tension. From the point of view of the non-compact 
dimensions, this is a cosmological constant. But the original theory had no such 
cosmological constant — this must be canceled by the orientifold. 

Just as it is not necessary to start from the Type I theory and its dualities to 
encounter D-branes, it is not necessary to start from the Type I theory to consider 
orientifolds. Starting from Type II theories, in particular, we can perform a pro- 
jection by world sheet parity times some Z, space-time symmetry. For example, 
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consider a Type II theory with a single compact dimension. On this theory, we can 
make a projection which is a combination of world sheet parity, Q, and reflection 
in the compact dimension. 


28.5 Strong—weak coupling dualities: the equivalence of 
different string theories 


We have seen that at weak coupling, there are a variety of connections between 
different string theories which are surprising from a field-theoretic perspective. 
The heterotic string, compactified on a circle of very large radius, is equivalent to 
a string theory compactified at very small radius (with a different coupling). The 
Type ITA theory at large radius is equivalent to the IIB theory at small radius. The 
O(32) heterotic string is equivalent to the Es x Eg theory. All of these equivalences 
involve significant rearrangement of the degrees of freedom. Typically Kaluza— 
Klein modes, which are readily understood from a space-time field theory point of 
view, must be exchanged with winding modes, which seem inherently “stringy.” So 
perhaps it is not surprising that there are other equivalences, involving weak and 
strong coupling. Again, we have had some inkling of this in field theory, when we 
studied N = 4 Yang—Mills theory. There, the theory at weak coupling is equivalent 
to a theory at strong coupling. To see this equivalence, one needs to significantly 
rearrange the degrees of freedom. States with different electric and magnetic charge 
exchange roles, as the coupling is changed from strong to weak. 

In string theory, there is a complex web of dualities. The IIB theory in ten 
dimensions exhibits a strong—weak coupling duality very similar to that of N = 4 
Yang—Mills theories; weak and strong coupling are completely equivalent. The 
O(32) heterotic string theory, in ten dimensions, is equivalent at strong coupling 
to the weakly coupled Type I theory. These relations are surprising, in that these 
theories appear to involve totally different degrees of freedom at weak coupling. 
But there are more surprises still. The strong coupling limit of the ITA theory in ten 
dimensions is a theory whose low-energy limit is eleven-dimensional supergravity. 
If we allow for compactifications of the theory, this set of dualities is already enough 
to establish an equivalence of all string theories, as well as some as yet not fully 
understood theory whose low-energy limit is eleven-dimensional supergravity. But 
as we compactify, we find further, intricate relations. For example, the Type IIA 
theory on K3 is equivalent to Eg x Eg on T+. Given that all of the sensible theories 
of quantum gravity we know are equivalent, it is plausible that, in some sense, there 
is a unique theory of quantum gravity. As we will see, however, we only know 
this reliably for theories with at least 16 supercharges. For theories with four or 
less, this situation is less clear; it is by no means obvious that the statement is even 
meaningful. 
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In the next sections, we will explore some of these dualities, and the evidence 
for them. We will also discuss two particularly surprising equivalences. We will 
argue that certain string theories are equivalent to quantum field theories — even 
to quantum mechanical systems. The very notion of space-time in this framework 
will be a derived concept. 


28.6 Strong—weak coupling dualities: some evidence 


In the case of T-dualities, those dualities which relate the behavior of string theories 
at weak coupling and different radii, it is straightforward to understand the precise 
mappings between the different descriptions. Lacking a general non-perturbative 
definition of string theory, it is not possible to do something similar in the case 
of strong—weak coupling dualities. Instead, one can try to put together compelling 
circumstantial evidence. Without supersymmetry, even this is essentially impos- 
sible. But in the presence of sufficient supersymmetry one has a high degree of 
control over the dynamics. Evidence for equivalence can be provided by studying 
the following. 


(1) The effective action: in ten or eleven dimensions, the terms in the action with up to two 
derivatives are uniquely determined by supersymmetry, so they are not corrected either 
perturbatively or non-perturbatively. A similar statement holds for N > 4 actions in 
four (and actions with varying degrees of supersymmetry in between). In some cases, 
one can check higher-derivative terms in the effective action as well. 

(2) The spectrum of BPS objects: in many cases, the low-lying states are BPS objects. They 
cannot disappear from the spectrum as the coupling or other parameters are varied. With 
16 or more supercharges, they obey exact mass formulae. The identity of the BPS states 
for different theories provides non-trivial evidence for these equivalences. 


We will explore only some of the simplest connections here, but it is important 
to stress that these identifications are often subtle and intricate. In many instances 
where one might have thought the dualities mentioned above might fail, they do 
not. 


28.6.1 ITA — eleven-dimensional supergravity (M theory) 


We start with the IIA theory, where we can readily access both aspects of the 
duality. Comparing the actions of eleven-dimensional supergravity and the IIA 
theory is particularly straightforward, as the Lagrangian of the IIA theory is often 
obtained by compactifying eleven-dimensional supergravity on a circle, keeping 
only the zero modes. The basic degrees of freedom in eleven dimensions are the 
graviton, gyw, the antisymmetric tensor gauge field, Cyyo, and the gravitino, yy. 
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We are not going to work out the detailed properties of this theory, but it is a useful 
exercise to check that the numbers of bosonic and fermionic degrees of freedom 
are the same. As usual, we can count degrees of freedom by going to the light 
cone (or using the “little group,” the group of rotations in D = 11 — 2 = 9). The 
metric is asymmetric, traceless tensor; for the gravitino, we need also to impose the 
constraint y/wW; = 0. For the metric, then, we have ((9 x 10)/2) — 1 = 44, while 
from the three-index antisymmetric tensor we have (9 x 8 x 7)/3! = 84, fora total 
of 128 bosonic degrees of freedom. From the gravitino, we have 9 x 16 — 16 = 128 
degrees of freedom. 

If we compactify x!° on a circle of radius R, we obtain the following bosonic 
degrees of freedom in ten dimensions. 


(1) The ten-dimensional metric, g, (u,v = 0,..., 9). 

(2) From gio, we obtain a vector gauge field. This is identified with the Ramond—Ramond 
vector field of the ITA theory. 

(3) From Cjo,,, we obtain an antisymmetric tensor field, identified with the antisymmetric 
tensor, B,,, of the NS—NS sector of the IIA theory. 

(4) From C,,,,, we obtain the three-index antisymmetric tensor field of the R—-R sector of 
the IIA theory. 

(5) From gio,190 we obtain a scalar field in ten dimensions, the dilaton of the IIA theory. 
Note that this mode corresponds to the radius, R, of the eleventh dimension. 


Now consider the action. We will examine just the bosonic terms. These are 
constructed in terms of the curvature tensor, the three-index antisymmetric tensor, 
and its corresponding four-index field strength, F: 


—1 1 
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(28.29) 


AS we indicated, the dimensional reduction of this theory gives the Lagrangian of 
the ITA theory in ten dimensions. It is convenient to parameterize the fields in terms 
of the vielbein, ew. Then: 


A 
Aa(% An 28.30 
emu ( 0 a . ( : ) 
Correspondingly, the metric has the structure: 
_ A,B = Suv Ry A, 
&MN = CyeNNAB = CoA R?, ) , (28.31) 


If we simply plug these expressions into the Lagrangian, the coefficient of the 
Einstein, 7, term, will be proportional to R. In order to bring this Lagrangian to the 
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canonical, Einstein form, it is necessary to perform a Wey] rescaling of the metric. 
But instead, we will perform the rescaling so as to bring the action to “string frame.” 
In this frame, all of the NS—NS fields have a factor of e~2% out front, where e~2% 
is the string coupling (dilaton). In ten dimensions, ,/g = e transforms like (uv)? 
under an overall rescaling of the metric; ? transforms like (ea) So we need to 
rescale: 


Swe RY Bde (28.32) 


The three form, C, upon reduction, leads to various fields in ten dimensions. The 
components Cj9,,, give the NS—NS two-form. The fields C,,,, give the R—R three- 
form. The R-R one-form field arises from the g0,,, components of the metric. The 
ten-dimensional action becomes: 


S = Sys + Sp (28.33) 


with 


Sws = ; / dea e (r + (Vo) — >) (28.34) 


1 1 1 


We have seen that, when the action is written this way, R is related to the coupling 
of the ten-dimensional string theory. The Weyl rescaling, g,, — ed Tek gives 
an action with R°? out front, i.e. 


Via “83 9 (dR 
L=R? (-32- mes Ee al e “) ) (28.36) 
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In this form, the unit of length is the string scale, £,. So loops come with a factor 
of Re, (the ultraviolet cutoff is €> '). So we see that 


(28.37) 


We can derive this relation another way (not keeping 27s), which makes a 
more direct connection between eleven-dimensional supergravity and strings. The 
eleven-dimensional theory has membrane solutions. We will not exhibit these here, 
but this should not be too surprising: the three-form, Cywo, couples naturally to 
membranes. The eleven-dimensional theory has only one scale, ¢);, so the tension 
of the membranes is of order ve . We can wrap one of the coordinates of the 
membrane around the eleventh dimension. If the eleventh dimension is very small, 
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the result is a string propagating in ten dimensions, with a tension: 
Palka. (28.38) 


Now, again, the ten-dimensional gravitational coupling is related to £,; by 


at 
Gi = —. 28.39 
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So we have our first piece of circumstantial evidence for the connection. Let’s 
turn now to the BPS spectrum. Consider, first, the eleven-dimensional supersymme- 
try algebra. Eleven-dimensional spinors can be decomposed into ten-dimensional 
spinors of definite chirality, with indices a and dq. In this basis, 


1 
Ti (; : (28.41) 


The eleven-dimensional momenta decompose into ten-dimensional momenta and 
P11 in an obvious way: 


{Qa, Qa} =Bag + Pirdaa- (28.42) 


From a ten-dimensional point of view, the last term is a central charge. In the 
presence of such a central charge, we can prove a BPS bound as we did for the 
monopole. This bound is saturated by the Kaluza—Klein modes of the graviton and 
the antisymmetric tensor field. What charge does this central charge correspond 
to in the IIA theory, and to which states do the momentum states correspond? It 
is natural to guess that this is one of the R-R charges. The simplest possibility 
is the charge associated with the one-form gauge field. The carriers of the one- 
form charge are DO-branes. The DO-branes are BPS states — they preserve half 
of the ten-dimensional supersymmetry. So states of definite eleven-dimensional 
momentum are states of definite D-brane charge. More precisely, localized states 
with N units of Kaluza—Klein momentum correspond to zero-energy bound states 
(so-called threshold bound states) of N D-branes. 

There are numerous further tests of this duality. For example, if one compactifies 
the theory further, there are connections to IIB theory. There are also connections 
involving M5-branes. But this discussion gives some flavor of the duality, and the 
evidence. 
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28.6.2 IIB self-duality 


The IIB theory exhibits an interesting self-duality. We can understand this, first, 
from the Lagrangian. The Lagrangian for the NS-NS fields is the same as for the 
IIA theory. For the R-R fields, we have now zero and two, and four-form fields. 
The Lagrangian for these is similar, with appropriate indices, to that for the R-R 
fields of the IIA case. A careful examination shows that under the transformation 
¢ — —¢ the Lagrangian goes into itself. At the classical level, the action is also 
invariant under shifts of the axion. 

Grouping the dilaton, e?, and the Ramond—Ramond scalar, 6, into a complex 
field, 


Ti (28.43) 


it then is natural to conjecture that the underlying theory has an SL(2, Z) symmetry 
similar to that of N = 4 Yang—Mills theory: 


at+b 
> 
ct+d 


ad — bc =1. (28.44) 


Further evidence for this symmetry is obtained by studying BPS objects: the 
various branes of the theory. In the IIB theory, we have fundamental strings and 
D1-branes; we also have D5-branes. Under this duality, the fundamental strings 
are mapped into D1-branes by the SZ(2, Z) transformations. Correspondingly, 
the H3 form (which couples to fundamental strings) should be mapped into the 
F form (which couples to D1 strings). The D3-branes are associated with the 
gauge-invariant five-form field strength, which is self-dual, so we might expect 
the D3-branes to be invariant. Study of the BPS formulae for these states lends 
support to these conjectures. 

This leaves the D5-branes. These couple to the Ramond—Ramond six-form form 
gauge field, which is associated with a seven-form field strength, which is in turn 
dual to the three form R-R field strength. In other words, the D5-brane is a magnetic 
source for F3. So we might expect these to be dual to something which is a magnetic 
source for the NS three form. This would be an NS 5-brane. Such an object can 
be constructed as a soliton of the ten-dimensional IIB supergravity theory. It plays 
an important role in understanding the duality of these theories. It also appears in 
other contexts. For example, in M theory, it is associated with a seven-form field 
strength, which is dual to the four-form field strength we have already encountered. 
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The M5 solution is: 


&mn = Sinn Suv = Nuv (28.45) 
Linas = SG, Oe (28.46) 
2¢ _ 200) , 
Here jz, v are the coordinates tangent to the brane (they are the world-volume 
coordinates); m,n... are the coordinates transverse to the brane. The SL(2, Z) 


duality of the IIB theory is quite intricate and beautiful. There are many subtle and 
interesting checks. 


28.6.3 Type I—O(32) duality 


The duality between the Type I and O(32) theories is particularly intriguing, as it 
is a duality between a theory with open and closed strings and a theory with closed 
strings only. It is also puzzling since the perturbative spectra of these theories, at 
the level of massive states, are quite different. The O(32) heterotic theory contains 
towers of massive states in spinor representations; there is nothing like this in the 
perturbative spectrum of the Type I theory. By way of evidence, we can begin, 
again, with the effective Lagrangian. For the heterotic theory this can be written 


/ d°xe~9(R + |Vol? + F? + dB’). (28.48) 


Here e~” is the dilaton field, and we have written the action in string frame. 
Consider, now, the transformation: 


g=er?’ b=-¢'. (28.49) 


This takes the action to: 
if dx /g(eF(R + |Vo' |?) + e-* F? + dB’). (28.50) 


This is the action for the bosonic fields of the Type I theory. The closed string fields 
couple with g’, while the open string fields couple with g. In the Type I theory, the 
antisymmetric tensor is an R-R field, and as a result, no factor of the coupling (the 
dilaton) appears out front of its kinetic term. 

Now we can ask: how do the hetorotic strings appear in the open string theory? 
Here, we might guess that these strings would appear as solitons. More precisely, 
these strings are just the D1-branes of the Type I theory. At weak coupling, the 
tension of these strings will behave as 1/g, i.e. it will be quite large. In this sector, 
one can find states in spinorial representations of O(32), arising from configurations 
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Fig. 28.1. The strongly coupled heterotic string is described by an eleven- 
dimensional bulk theory and two segregated walls, on which gauge degrees of 
freedom propagate. 


of D1—D9-branes. Most important, the D1-branes are BPS. As a result, they persist 
to strong coupling, and in this regime their tension is small. We will not explore the 
various subtle tests of this correspondence, but other features one can investigate 
include the identification of the winding strings of the heterotic theory. 

Many other dualities among different string theories have been explored. These 
include an equivalence between heterotic string theory on a four-torus and Type IITA 
on K3, and equivalences of Calabi-Yau compactifications of the Type II theory and 
heterotic theory on K3 x T2. 


28.7 Strongly coupled heterotic string 


In ten dimensions, we have seen that the strong coupling limit of the IIA theory 
is a theory whose low-energy limit is eleven-dimensional supergravity. The strong 
coupling limit of the IIB theory is again the IIB theory. The strong coupling limit of 
the O(32) heterotic string is the Type I string. This still leaves the question: what is 
the strong coupling limit of the Eg x Eg heterotic string? The answer is intriguing. It 
has some tantalizing connections to facts we see in nature. It also suggests different 
ways of thinking about compactifications — inklings of the large extra dimension 
and warped space pictures which we will discuss in the next chapter. 

Horava and Witten recognized that the strong coupling limit of the heterotic 
string, like the ITA theory, is an eleven-dimensional theory. The theory is defined 
on an interval of radius R,,. The relation of R1; to the string tension and coupling 
are exactly as in the ITA case. This means that as the coupling gets large, the interval 
becomes large. We will refer to the full eleven-dimensional space as the “bulk.” The 
fields propagating in the bulk are a full eleven-dimensional supergravity multiplet: 
graviton, gravitino and three-form field. At the end of the interval, there are two 
walls (Fig. 28.1). These walls are similar to orientifolds, in that they are not dynam- 
ical (there are no degrees of freedom corresponding to motion of the walls). The 
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low-lying degrees of freedom on each wall are those of a supersymmetric Eg gauge 
theory: gauge bosons and gauginos in the adjoint representation. The Lagrangian 
has the structure of a bulk plus boundary term: 


I ee | 
S=- 5 dx /gR- > go Ary ji d'°x,/gTrF? +--+. (28.51) 
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Note that the gauge coupling is simply proportional to the sixth power of the eleven- 
dimensional Planck length. 

Support for this picture comes from a variety of sources. First, there is a subtle 
cancellation of gauge and gravitational anomalies. Second, the long-wavelength 
limit of this theory is ten-dimensional gravity plus Yang—Mills theory, with the 
relation between the gauge and gravitational couplings appropriate to the heterotic 
string (this is one way to fix the coupling constants). Further compactifications 
provide further checks. 


28.7.1 Compactification of the strongly coupled heterotic string 


One puzzle in the phenomenology of the weakly coupled heterotic string concerns 
the value of the gauge coupling and the unification scale. In the MSSM, the unifi- 
cation scale is two orders of magnitude below the Planck scale. If we imagine that 
the unification scale corresponds to a scale of compactification, then 


ie 
Aout X i (28.52) 


If we treat the left hand side as fixed, then as V becomes large, so does g,. Plugging 
in the observed values, g, is quite large. As we will now show, the situation in the 
strong coupling limit is much different — and much more promising. 

Consider compactification of the strongly coupled theory on a Calabi—Yau space. 
The full compact manifold, from the point of view of an eleven-dimensional ob- 
server, is the product of the interval times a Calabi-Yau space X. Such a configu- 
ration is an approximate solution of the lowest-order equations of motion. Even at 
the level of the classical equations of this theory, there are corrections arising from 
the coupling of bulk to boundary fields. These can be constructed in a power series 
expansion. Terms in the expansion grow with R,;, owing to the one-dimensional 
geometry in the eleventh dimension. They are proportional to «’/?, from the bulk— 
brane coupling in Eq. (28.51). On dimensional grounds, there is a factor of R~*, 
where r is the Calabi-Yau radius. The expansion parameter is thus 


6 =P Ri /R*. (28.53) 
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We can readily obtain the relation between the four-dimensional and eleven- 
dimensional quantities. Using the string relations (here we will be careful about 
factors of 2 and zr): 


26(,/\4 26/,y/\3 
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where V is the volume of the compact space X, and the eleven-dimensional 
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where R = V!/6, Putting in the “observed” value of ogy, and the four-dimensional 
Planck mass gives: 


RiMy, =18 R= 261; = (3 x 10!) GeV. (28.57) 


The regime of validity of the strongly coupled description is the regime where V 
and Rj; are large compared to £;;. We see that nature might well be in such a 
regime. If we evaluate the expansion parameter €, we find € ~ 1. Adopting the 
viewpoint that the ground state of string theory which describes nature should be 
strongly coupled, this, again, seems promising: the parameters of grand unification 
correspond to the point where the eleven-dimensional expansion is just breaking 
down, € © 1. This is in contrast to the weak coupling picture, which seems far from 
its range of validity. 

Apart from this rather direct phenomenological application of string theory ideas, 
there are two new possibilities which this analysis suggests. First, some compact 
dimensions might be large compared to the Planck scale (or any fundamental scale). 
Second, in a case with a one-dimensional geometry, this dimension can be signifi- 
cantly warped, i.e. the metric need not be a constant. These ideas underlie the large 
extra dimension and Randall—Sundrum models of compactification, which we will 
encounter in the next chapter. 


28.8 Non-perturbative formulations of string theory 


We have seen that, at least in cases with a great deal of supersymmetry, we have a 
surprisingly large access to non-perturbative dynamics. But much of the evidence 
for the various phenomena we have described is circumstantial, matching actions 
and spectra in various regions of a given string moduli space. We lack a general, 
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non-perturbative formulation of the theory, analogous to, say, the lattice formu- 
lations of Yang—Mills theories which we encountered in Part 1. One might have 
hoped that there would be a string field theory, analogous to ordinary quantum field 
theories, but such a program is fraught with conceptual and technical difficulties. 
We have mentioned some of these. In this section, we will describe situations where 
one can give a complete non-perturbative description. These descriptions are spe- 
cific to particular backgrounds: flat space in higher dimensions, and certain AdS 
spaces. In eleven dimensions, the flat space, supersymmetric theory can be described 
as an ordinary quantum mechanical system, while the theory compactified on an 
n-dimensional torus is described by a field theory in m + 1 space-time dimensions, 
up to n = 3. Quite generally, string theory (gravity) in AdS spaces is described 
by conformal field theories; this is known as the AdS—CFT correspondence. Both 
formulations exhibit what is believed to be a fundamental feature of any quantum 
theory of gravity: holography. The holographic principle asserts that the number of 
degrees of freedom of a quantum theory of gravity grows, not as the volume of the 
system, but as its area. 


28.8.1 Matrix theory 


We have seen that the strong coupling limit of the IIA theory is an eleven- 
dimensional theory, whose low-energy limit is eleven-dimensional supergravity. 
DO-branes were crucial in making the correspondence. The Kaluza—Klein states 
of the eleven-dimensional theory were bound states of DO-branes; states with mo- 
mentum N/R; corresponded to zero-energy (“threshold”) bound states of N DO- 
branes. The world-line theory of N DO-branes is ten-dimensional U (NV) Yang-Mills 
theory reduced to zero dimensions. The action which describes this system is: 


1 oan ol eae talt ak! 
S= fa [o,x'D,x" + 55 MOR eX, KX x) 
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where Rj, is the eleven-dimensional radius, M is the eleven-dimensional Planck 
mass and g = 2R,,. The Xs are the bosonic variables, X;,/ = 1,...,9; Os are 
the fermionic coordinates. It is necessary to impose Gauss’s law as a constraint on 
states. 

Classically and quantum mechanically, this system has a large moduli space, 
corresponding to configurations with commuting X/s. For large X’, the spectrum 
in these directions consists, in the language of quantum mechanics, of 9N free 
particles, and a set of oscillators with frequencies of order |X |. We can integrate out 
the fast degrees of freedom, obtaining an effective action for the low-energy degrees 
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of freedom, the Xs and their superpartners. The bosonic states are just momentum 
states for these particles. They are the states corresponding to the collective modes 
of the D-branes. 

Banks, Fischler, Shenker and Susskind made the bold hypothesis of identify- 
ing these degrees of freedom, and the Lagrangian of Eq. (28.58), as the com- 
plete description of the eleven-dimensional theory, in the limit that N — oo. More 
precisely, the Hamiltonian following from the action of (28.58) is identified with 
the light cone Hamiltonian, and AN is identified with the light cone momentum, 
P+ = N/R. In the large-N limit, this becomes a continuous variable; it is neces- 
sary to take R — o ata suitable rate. The first step in this identification is to note 
that the spectrum of low-lying states of the matrix model is precisely that of the 
light cone supergravity theory. We have already noted that the states are labeled 
by a momentum nine-vector, p. In addition, there are sixteen fermionic variables, 
the partners of the bosons. As in other contexts, we can define eight fermionic 
creation operators and eight fermionic destruction operators. From these we can 
construct a Fock space with 256 states, of which half are space-time bosons (integer 
spin), and half are fermions. This is just the correct number to describe a graviton 
and antisymmetric tensor in eleven dimensions, and their superpartners. The states 
transform correctly under the little group. 

A more convincing piece of evidence comes from studying the S-matrix of the 
matrix theory. Consider, for example, graviton—graviton scattering. Integrating out 
the massive states of the theory gives an action involving derivatives of x. We won’t 
reproduce the detailed calculation here, but the basic behavior is easy to understand. 
One can compute the action from Feynman graphs, just as in field theory. With four 
external Xs, simple power counting gives an action, in coordinate space, behaving 


as: 
: dk v4 
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Here M x |X| = R, the separation of the gravitons. The four factors of v correspond 
to the four derivatives in the graviton—graviton amplitude; 1/R’ is precisely the form 
of the graviton propagator in coordinate space. With a bit more work, one can show 
that one obtains precisely the four-graviton amplitude in eleven dimensions, for 
suitable kinematics. 

M theory compactified on an n-torus is described by ann + 1-dimensional field 
theory. We won’t argue this, but note that in this case the power counting is correct to 
give the right graviton—graviton scattering amplitude. If n > 3, however, the theory 
is non-renormalizable, and the description does not make sense. An alternative 
description can be formulated for dimensions down to six. The matrix model has 
been subjected to a variety of other tests. It turns out that the large-N limit is not 
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necessary; for fixed N, one describes a discretized version of the light cone theory 
(DLCQ). One can actually derive this result, starting with the assumed duality 
between IIA theory and eleven-dimensional supergravity. 

All of this is quite remarkable. Without even postulating the existence of ordi- 
nary space-time, we have uncovered space-time, and general relativity, in a simple 
quantum mechanics model. One interesting feature of these constructions is the 
crucial role played by supersymmetry. Without it, quantum effects would lift the 
flat directions and one would not have space-time — though one would still have a 
sensible quantum system. One might speculate that what we think of as space-time 
is not fundamental, but almost an accident, associated with the dynamics of partic- 
ular systems. Lacking, however, a formulation for a realistic, non-supersymmetric 
system, this remains speculation. 


28.8.2 The AdS/CFT correspondence 


An equally remarkable equivalence arises in the case of string theory on AdS 
spaces. This connection was first conjectured by Maldacena, and is referred to as 
the AdS/CFT correspondence. It asserts that graviton theories in AdS spaces have 
a description in terms of conformal field theories on the boundary. 


A little more general relativity: anti-de Sitter space 
We could construct anti-de Sitter space by solving the Freedman equation with neg- 


ative cosmological constant. Instead, we will adopt a more geometrical viewpoint. 
Starting with a flat p + 3-dimensional space, with metric: 


ptl 
ds? = —dx) — dx>,,, + )° dx; (28.60) 
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we consider the hyperboloid: 
p+l 
xo tx — Do xp SR. (28.61) 
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These coordinates can be parameterized in various ways. For example, one can take 


x9 = Rcosh(p) cos(T), Xp42 = Rcosh(p) sin(t) 
xj = R sinh(p)Q; (i =1,...p+1; Q? = 1): (28.62) 


This automatically satisfies (28.61), and yields the metric: 


ds* = R*(—cosh’ pdt* + dp’ + sinh? pd’). (28.63) 
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In making the correspondence, another parameterization is helpful. These cover 
one half of the hyperboloid 


1 
xo = 5 + wR + —1°)); Xpy2 = Rut 
Uu 
x' = Rux' (i= 1,...p) 
1 > 
peel sil =i (R227 1). (28.64) 
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The metric is then: 
2 2 du? 2 2 =2 
ds“ =R = tu (—dt* + dx*)). (28.65) 
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AdS has interesting features, which we will not fully explore here. There is a 
boundary at spatial infinity (u = oo). Light can reach the boundary in finite time, 
but not massive particles. Ina cosmological context, negative cosmological constant 
leads not to an eternal AdS space but to a singularity. The last form of the metric will 
be useful in making the correspondence in a moment. The metric has isometries 
(symmetries); the group of isometries can be seen from the form of the hyperboloid 
and the underlying metric of the p + 3-dimensional space; itis SO(2, p + 1). This 
turns out to be the same symmetry as conformal symmetry in p + 1 dimensions; 
this, again, is a crucial aspect of the AdS/CFT correspondence. 


Maldacena’s conjecture 


Maldacena originally discovered this connection for the case of string theory on 
AdSs x Ss. One suggestive argument starts by considering a set of N parallel D3- 
branes. We have discussed such configurations as open string configurations, but 
they can also be uncovered as solitonic solutions of the supergravity equations, here 
of the HB theory. For these, the metric has the form: 


ds? = H(y)"'*dx"dx, + H(y)'/? (dy? + y?dQ2) 
Fuvpot = Euvpatad” H. (28.66) 


Here the xs are the coordinates tangent to the branes, while the ys (and their 
associated angles) are the transverse coordinates. The dilaton in this configuration 
is aconstant; the other antisymmetric tensors vanish. The function H, for N parallel 
branes, is 


Art gs(a o 


Boe (28.67) 


HG) = 14 TERY 
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This can be rewritten as: 
2 —1/2 L2 1/2 
ds* = (1 s =) Nuvdx"dx” + (1 + =) (dy* + y7dQ3). (28.68) 


The parameter L is related to the string coupling, g,, the brane charge (number of 
branes) N, and the string tension by: 


L* = 47g,N(a'y’. (28.69) 


It is convenient to introduce a coordinate u = L?/y, and to take a limit where N 
and g, are fixed, while aw’ — 0. The metric then becomes: 


1 du? 
ye ie | Sathdx’ ae — s act (28.70) 


The terms involving u and x we have seen previously; this is the geometry of Ad Ss. 
The remaining terms describe a five-sphere of radius L. 

Now from a string point of view, the low-energy limit of the system of N D3- 
branes is described by N = 4 Yang-Mills theory. So we might, with Maldacena, 
conjecture that there is just such an equivalence. Not surprisingly, demonstrating 
this equivalence is not so simple. One needs to argue that on the string side, the bulk 
modes (graviton, antisymmetric tensors, and so on) decouple, as do the massive 
excitations of the open strings ending on the branes. One cannot argue this at weak 
coupling, and it would be surprising if one could; in that case, one could calculate 
any quantity in the gravity theory in a weak coupling perturbation expansion in 
the Yang—Mills theory. This is similar to the situation in the matrix model. There 
are, however (as in the matrix model), many quantities which are protected by 
supersymmetry, and quite detailed tests are possible, both in this case and for many 
other examples of the correspondence. 


Suggested reading 


Non-perturbative string dualities are discussed extensively in the second volume 
of Polchinski’s (1998) book. This provides an excellent introduction to D-branes. 
D-branes are treated at length in the text by Johnson (2003), as well. The reader may 
want to consult earlier papers on duality, especially Witten (1995). Matrix theory and 
the AdS/CFT correspondence are treated in several excellent pedagogical reviews 
(Bigatti and Susskind, 1997; Aharony et al., 2000; D’ Hoker and Freedman, 2002), 
but the original papers are very instructive; see, for example, Banks ef al. (1997); 
Seiberg (1997), Maldacena (1997), Witten (1998). 
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Exercises 


(1) D-branes: for a stack of N D-branes, write the open string mode expansions. Show 
that, for small separations, the spectrum looks like that of a Higgsed U(N) field theory, 
with Higgs in the adjoint representation. In the light cone gauge, check the counting of 
supersymmetries for open strings and D-branes. 

(2) Verify the construction of the bosonic terms in the ten-dimensional action from the 
dimensional reduction of the eleven-dimensional action. 

(3) Verify that the NS5-brane is a solution of the ten-dimensional supergravity equations. 

(4) Take the long-wavelength limit of the Horava—Witten theory. Write the Lagrangian in 
the ten-dimensional Einstein frame and verify that the gauge and gravitational couplings 
obey the relation appropriate to the heterotic string theory: 


85m = 4K!" (28.71) 


(5) Calculate the effective action of the matrix model at one loop in more detail. Verify 
that treated in Born approximation, this yields the correct graviton—graviton scattering 
matrix element for the eleven-dimensional theory. You may find the background field 
method helpful for this computation. 

(6) Check that the configuration of Eq. (28.66) solves the field equations of IIB supergravity 
in the case of a single brane. You may want to use some of the available programs for 
evaluating the curvature. Verify that, in the Maldacena limit, the metric can be recast as 
in Eq. (28.30). If one requires that the curvature of the AdS space is small, check that 
the D-brane theory is strongly coupled. Discuss the problem of decoupling. 
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Large and warped extra dimensions 


Considerations of the sort we encountered in the previous chapter have inspired 
two approaches to physics beyond the Standard Model: large extra dimensions 
(ADD) and warped spaces (Randall—Sundrum). In this chapter we will provide a 
brief introduction to each. 


29.1 Large extra dimensions: the ADD proposal 


In string theory, it is natural to imagine that the compactification scale is not too 
much different from the Planck scale. The size of the compact space is typically a 
modulus, and if itis stabilized, one might expect it be stabilized at a value not much 
different than one, in string (and therefore Planck) units. In terms of our general 
discussion of moduli stabilization, this is precisely what we would expect: once the 
radius becomes very large, any potential, perturbative or non-perturbative, tends to 
Zero. 

But if we are willing to discard this prejudice, an extraordinary possibility opens 
up. Perhaps the extra dimensions are not Planck size, but much larger, even macro- 
scopic? Arkani-Hamed, Dimopoulos and Dvali realized that from an experimental 
point of view, the limits on the size of such large compact dimensions are sur- 
prisingly weak. Allowing the extra dimensions to be large totally reorients our 
thinking about the nature of couplings and scales in string theory (or any under- 
lying fundamental theory). Such a viewpoint places the hierarchy problem in a 
whole different light, perhaps allowing entirely different solutions than technicolor 
or supersymmetry. 

Branes are crucial to this picture. The observed gauge couplings are small, but 
not extremely small. But in Kaluza—Klein theory and in weakly coupled string 
theories, they are related to the underlying scales in a clear way. For example, in 
the heterotic string: 


gy = a, °MOR®. (29.1) 
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So if gq is fixed, as R > oo, g, > oo. But even in acompactified theory, the gauge 
coupling on D3-branes is insensitive to the large volume. With more general branes, 
one has more intricate possibilities, depending on how the branes wrap the internal 
space. On the other hand, gravity becomes weak as R becomes large: 
1 1 2 

= (29.2) 


2 — 778 pb 8 Rb" 
M2 M8R° @&R 


Gy= 


Now if g, is fixed and of order one, as R — ox, the Planck length tends to zero. 
So how large might we imagine R could be? If we assume that R is macroscopic 

or nearly so, then on distance scales smaller than R, the force of gravity will be that 

appropriate to a higher-dimensional theory. In d space-time dimensions, 


Force, ~ (29.3) 


7d" 
Experiments currently probe possible modifications of the gravitational force law 
on scales of order millimeters or somewhat smaller. (Since the proposal of large 
extra dimensions was put forward, these limits have been significantly improved.) 

If the scale of the large extra dimensions is of order millimeters, how large is 
the fundamental scale? This depends on the number of dimensions that are actually 
large. If there are a large extra dimensions, any others being comparable in size to 
the fundamental scale, 


Mz = MiiGR*, (29.4) 
or 
aks, 
Mima = (M5 R~*)** R= My'(My/Miuna) OF". (29.5) 


A new viewpoint on the hierarchy problem arises by supposing that Muna is 
close to the scale of weak interactions, say 


Meuna ~ 1 TeV. (29.6) 


Then we can use Eq. (29.5) to relate R to the Planck scale and the weak scale. For 
example, if a = 2, R ~ 0.01 cm! For larger a, it is smaller, but still dramatically 
large; for a = 3, for example, it is about 10-7 cm. But a = 1 would be, quite 
literally, astronomical in size, and is clearly ruled out by observations. 

What is quite surprising is that it is difficult to rule out dimensions with size 
of order millimeters. Since the original proposal, there have been several experi- 
ments dedicated to improving the limits on deviations from Newtonian gravity at 
millimeter distances. 

The possibility of large extra dimensions offers a different perspective on the 
hierarchy problem. The weak scale is fundamental; the issue is to understand why 
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the radius of the large dimensions is so large. One possibility which has been 
seriously considered is that there are some very large fluxes. For example, if Hy 
is a two-form associated with a U(1) gauge field, and & is some closed two- 
dimensional surface, we can have: 


/ Hyndx™ A dx =N. (29.7) 
z 
If the radius of the dimensions associated with © is large, then 
N 
H ~ Re (29.8) 


The potential, in turn, receives a contribution behaving as N*/R?. If there is also a 
(positive) cosmological constant, 


V=AR*+ nes (29.9) 
R2 , 


and assuming that A is of order the fundamental scale, 
RONG 4 (29.10) 


To obtain a sufficiently large radius in this way, then, requires an extremely large 
flux. There are some circumstances where such large pure numbers may not be 
required; supersymmetry and low dimensionality (a = 2) help. 

For now, we will assume that somehow a large radius arises for dynamical 
reasons, and consider some of the other questions which, ultimately, such a picture 
raises. 


(1) Proton decay: with no further assumptions about the theory, we would expect that 
baryon number violating operators would arise suppressed only by the TeV scale. It 
would then be necessary to suppress operators of very high dimension. One possible 
resolution of this problem is elaborate discrete symmetries. Another suggestion has 
been that the modes responsible for the different low-energy fermions might be very 
nearly orthogonal. 

(2) Other flavor changing processes: for the same reason, flavor changing processes in 
weak interactions, processes like 4 — e+ y, and the like pose a danger. One possible 
solution is a fundamental scale a few orders of magnitude larger than the weak scale. 
This raises the question of why the weak scale is small — the hierarchy problem again. 
Orthogonality of fermions, again, can help with many of these difficulties. 


We turn, finally, to the phenomenology of large extra dimensions. Here there are 
exciting possibilities. If R is large, the Kaluza—Klein modes are very light. They 
are very weakly coupled, but there are lots of them and little energy is required for 
their production. So consider inclusive production of Kaluza—Klein particles in an 
accelerator. In terms of Gy = «*/8z the amplitude for emission of a Kaluza—Klein 
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particle is proportional to «. For any given mode, then, the cross section behaves as 
0, ~ Gy E”, where the E” factor follows from dimensional analysis. We need to 
sum over n — equivalently, to integrate over a-dimensional phase space. As a crude 
estimate, we can treat the amplitude as constant, and cut off the integration at FE, so 


Orr = R* / d"k o, = GyR°E***, (29.11) 


Recalling that Gy = GgnaR~“*, we see that the tower of Kaluza—Klein particles 
couples like a 4 + a-dimensional particle — i.e. at high energies, the extra dimen- 
sions are manifest! The cross section exhibits exactly the behavior with energy one 
expects in 4 + a dimensions. 

The actual processes which might be observed in accelerators are quite dis- 
tinctive. One would expect to see, for example, production of high-energy photons 
accompanied by missing energy,with the cross section showing a dramatic rise with 
energy. Such signatures have already been used (as of this writing) to set limits on 
such couplings. 

The production of Kaluza—Klein particles in astrophysical environments can be 
used to set limits on extra dimensions as well. For example, in the case of two large 
dimensions and fundamental scale of order | TeV, we saw that the scale of the 
Kaluza—Klein excitations — the inverse of the radius of the extra dimensions — is of 
order 10~!* GeV, so such particles are easy to produce. Like axions, they could be 
readily produced in stars. 


29.2 Warped spaces: the Randall-Sundrum proposal 


Having entertained the possibility that some compact dimensions of space might 
be very large, one might wonder why the extra dimensions should be flat. The 
Horava—Witten theory provides a model. Taking the formulas of this theory literally, 
we have seen that if this theory describes nature, the eleventh dimension is quite 
large in fundamental units. The metric of this dimension is significantly distorted; 
we might say that it is warped. This is not surprising. The geometry is essentially 
one-dimensional. Green’s functions for the fields grow linearly with distance. One 
of the appealing features of the Horava—Witten proposal is that the dimensions are 
just large enough that the distortion of the geometry is of order one. 

Randall and Sundrum have made a more radical proposal: they argue that the 
warping might be enormous, and might account for the large hierarchy between 
the weak scale and the Planck scale. In the simplest version of their model, there is 
again one extra dimension; call its coordinate ¢, 0 < ¢@ < a. The model contains 
two branes, one at ¢@ = 0, one at @ = z. The tensions of the two branes are taken 
to be equal and opposite. One imagines that the Standard Model fields propagate 
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on one brane, the “visible sector” brane, while some other, hidden sector fields 
propagate on the other. The action is: 


S = Sgray + Svis + Shia- (29.12) 
The bulk gravitational action, Sg;ay, includes a cosmological constant term: 
Soray = : d*x / dp/—G[—A +2M?R], (29.13) 
where M is the five-dimensional Planck mass. The brane actions are: 
Svis = / d*x/—BvisLLvis — Avis] Shia = i d*x/—8nialLnia — Ania]. (29.14) 


Here we have separated off a brane tension term on each brane; we have also 
distinguished the bulk five-dimensional metric, Gyw, from the metrics on each of 
the branes, g,,,. This has the structure of a gravitational problem in five dimensions, 
with 6-function sources at @ = 0, z. Einstein’s equations are: 


1 1 ; 
V =G(Ruw- ;Guvk) = Tap [Av —GGyntAvisv —8visS yoy ONOP—T) 


+ Mriav= Bride 55,50). (29.15) 
Now one makes an ansatz for the metric, which leads to warping: 
ds* = en dx" dx” + r2dg’. (29.16) 
Plugging in the ansatz, Eq. (29.16), one obtains equations for o: 
/ ” 
~ = a - = = ate) + ie — 1m). (29.17) 
This is solved by: 
A 
o =7r-|¢| — 54m (29.18) 


provided that the following conditions on the As hold: 
Ania = Avis = 24M7k A = —24M7K?. (29.19) 


In this case, the metric varies exponentially rapidly. Note that r, does not need to 
be terribly large in order that one obtain an enormous hierarchy. One might worry, 
though, about the identification of the graviton. It turns out that the metric has zero 
modes: 


ds* = e™*l6l[n + Ayy(x)dx"dx” + T?(x)d¢"], (29.20) 
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where T° represents a variation of r,, and is usually referred to as the radion, and 


hy, is the four-dimensional metric. If one substitutes in the action, one finds: 


S= i d*x / do2M?r,e77*"l#|, /—BR. (29.21) 


From this we can read off the effective Planck mass: 
M3 
M; = M?r, i doe rll — = [1-e *]. (29.22) 


So the four-dimensional Planck scale is comparable to the fundamental five- 
dimensional scale. 

To see that the physical masses on the visible brane are small, consider the visible 
sector action for a scalar particle: 


Svis = / dbx =e | B,e™™|DiGl? — A(I6P — v4) |. (29.23) 


Rescaling ¢ > e! 


Six = f dtx/=B [BuslD.oP —a(loP —eAG)"], 29.28 


so the scale is indeed exponentially smaller than the scale on the other brane. 
There are many questions one can ask about this structure. 


™ ob, we have: 


(1) How robust is this sort of localization of gravity? 

(1) How do higher excitations, e.g. bulk fields, interact with the fields on the brane? Is the 
hierarchy stable? (The answer is yes.) 

(3) Does this sort of warping arise in string theory? Again, the answer is yes, though the 
details look different. 

(4) As in the case of large extra dimensions, if this picture makes sense, there are many 
excitations on the branes. Higher-dimension operators are suppressed only by the TeV 
scale. As there, one has to ask: how does one understand conservation of baryon number? 
Other flavor changing processes? Neutrino masses? Precision electroweak physics? 
Answers have been put forward to all of these questions, but they remain suitable 
subjects for research. 

(5) Assuming the above problems are resolved, what are the experimental signals for such 
warping? As in the case of large extra dimensions, one wants to focus on the additional 
degrees of freedom associated with bulk fields and the brane. In this case, unlike the 
case of large extra dimensions, the Kaluza—Klein states are not dense. Instead, the low- 
lying states have masses and spacings of order the TeV scale. Their couplings are not 
of gravitational strength, but instead scaled by inverse powers of the scale of the visible 
sector brane. 


Finally, there are other variants of the Randall—Sundrum proposal which have 
been put forward. Perhaps the most interesting is one in which space is not 
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compactified at all, but simply warped, with gravity localized on the visible brane. 
These ideas suggest a rich set of possibilities for what might underlie a quantum 
theory of gravity. Some of these features — the exponential warping of the metric, 
in particular — have been observed in string theory, but many, at least to date, have 
not. This is a potentially important area for further research. 


Suggested reading 


The original paper of Arkani-Hamed et al. (1999) is quite clear and comprehensive, 
as is the paper of Randall and Sundrum (1999). The phenomenology of the Randall— 
Sundrum models is explored by Davoudias! et al. (2000). 


Exercise 


(1) Verify the Randall—Sundrum solution of Eq. (29.15). 


30 
Coda: Where are We Headed? 


As this book is being completed, the Large Hadron Collider (LHC) at CERN, and 
its two large detectors, ATLAS and CMS, are nearing completion. The center of 
mass energy at this machine will be large, about 14 TeV. The center of mass energies 
of the partons — the quarks and gluons — within the colliding protons will be larger 
than 1 TeV. The luminosity will also be very large. As a result, if almost any of the 
ideas we have described for understanding the hierarchy problem in Part 1 of this 
book are correct, evidence should appear within a few years. For example, if the 
hypothesis of low-energy supersymmetry is correct, we should see events with large 
amounts of missing energy, and signatures such as multiple leptons. Large extra 
dimensions should be associated with rapid growth of cross sections for various 
processes, again with missing energy; the warped spaces suggested by Randall 
and Sundrum should be associated with the appearance of massive resonances. 
Technicolor, similarly, should lead to broad resonances. Assuming some under- 
lying technicolor model can satisfy constraints from flavor physics and precision 
electroweak measurements, one might expect to find some number of light (com- 
pared with 1 TeV), pseudo-Goldstone bosons, many with gauge quantum numbers. 
If any of these phenomena occur, distinguishing among them in the complicated 
environment of a hadron machine will be challenging. It is conceivable that there 
will be competing explanations, and that choosing between them will require a 
very high-energy electron—positron colliding beam machine. Such a machine is 
under consideration by a consortium of nations, and is referred to as the Interna- 
tional Linear Collider, or ILC. Hopefully in later editions of this book, it will be 
possible to focus on real experimental results, rather than a range of theoretical 
speculation. 

Before the data rolls in, we might hope to select among these possibilities, or 
perhaps discover some crucial idea — and possible set of phenomena — that we 
are missing. Here, string theory might help. Many of the ideas for Beyond the 
Standard Model Physics require phenomena which can only be understood within 
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a quantum theory of gravity. This is certainly true of large extra dimensions or 
Randall—Sundrum. 

We have discussed many aspects of string theory, but as far as the world about 
us and the experiments which explore it, we have left things in a rather unsettled 
state. We have seen that string theories have many of the features we might hope 
for from nature. We have exhibited ground states — more precisely, approximate 
moduli spaces — with many of the features of the Standard Model: the observed 
gauge groups, repetitive generations of quarks and leptons, calculable gauge and 
Yukawa couplings, and more. But it is not clear how to make sharp predictions. 
There are vast numbers of moduli spaces with the wrong features: the wrong num- 
ber of dimensions, too much supersymmetry, the wrong gauge group and matter 
content, and we have not offered a dynamical mechanism or principle which might 
select among them. Not only are there discrete choices, but there are continuous 
ones as well, associated with the moduli. We have seen how potentials for the 
moduli arise, but we have not offered any idea for how stable or metastable vacua 
might arise, other than to note that such states will typically lie at strong coupling, 
where they are inaccessible to analysis. Note that our discussion of strong—weak 
coupling duality, by itself, does not help with this problem; our general argu- 
ments show that one cannot find stable vacua at arbitrarily weak coupling in any 
description. 

For a long time, string theorists hoped for some deus ex machina which might 
resolve this conundrum. Some have imagined that one would simply find some new 
type of string model or construction which would not suffer from these difficulties, 
perhaps leading uniquely to the Standard Model at low energies. Developments in 
string duality have suggested an alternative picture: there might exist a vast number 
of isolated, stable or metastable states of the theory, with little or no supersymmetry. 
These states seem to have a distribution of values of couplings, mass scales and 
cosmological constant. There may be an exponentially large or even infinite number 
of them. 

To understand how these come about, we return, again, to the IIB theories. In 
these theories, there are two three-form gauge fields, one arising from the NS—NS 
sector, the other in the R-R sector; denote these H and F. If one compactifies on 
a Calabi—Yau space, it is possible to have non-trivial backgrounds for these fields. 
One can define three-form fluxes by integrating these over non-trivial three cycles. 
The number of such cycles, we have learned, is h2,,. The fluxes are quantized, by 
an argument identical to Dirac’s. There are also constraints on the values of fluxes, 
resulting from absence of anomalies (violation of Gauss’s law). In the presence of 
fluxes, we might expect the system to have a non-trivial energy; this energy is a 
function of the values of the moduli, so there is a potential for the moduli. This 
potential can be described in terms of a superpotential. It turns out that there is a 
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simple formula for the superpotential: 
w= [Gna (30.1) 


where G3 = F3 — t H3 and wis the covariantly constant three-form. One can obtain 
explicit formulas in the case of compactification of the IIB theory on a torus. This 
expression depends on the complex structure moduli, z, but not on the Kahler 
moduli, p. 

The equations D,W = 0 generically fix the complex structure moduli and the 
dilaton but not the Kahler moduli. But the superpotential of Eq. (30.1) is not exact, 
and can receive non-perturbative corrections. These can lead to a potential for the 
Kahler moduli. Logically, we can organize the analysis by first integrating out the 
complex structure moduli and the dilaton, leaving a superpotential for the Kahler 
modulus, p, of the form: 


W,=Wote®. (30.2) 


The Kahler potential, on the other hand, for large p and weak coupling should not 
be too different than its tree-level form: 


K =—3In(p + 0°). (30.3) 


If Wo is small, it is not hard to see that the potential has a supersymmetric stationary 
point at 


p &* —c|n(Wo). (30.4) 


But why should Wo be small? The answer is that, in general, it isn’t, but there are 
many possible choices of flux, and so there are many different possible states with 
different values of Wo. In this construction, the number of possible flux choices 
which permit a supersymmetric low-energy theory for ¢ is finite but extremely 
large. One can understand this by thinking « of a b-dimensional vector of integers, 
N, representing the fluxes, constrained by |N |> < L?. The number of possible flux 
choices is then the volume of a b-sphere of radius L, or of order 


N &L?. (30.5) 


In interesting Calabi-Yau compactifications, L and b can be of order 100s. 

If we suppose that Wo is distributed more or less randomly among these states, 
we might expect that the probability of finding Wo at small Wo would be roughly 
uniform with Wo as a complex variable, i.e. f d* Wo P(Wo), with P(Wo) constant. 
Numerical studies suggest that this is indeed the case. 

Other quantities will also vary across the landscape. One can get gauge groups, 
for example, by including branes in these configurations. There will be, then, a 
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distribution of groups and couplings. Perhaps not surprisingly, the distribution of 
gauge couplings, at small gauge coupling, is typically flat in g?. One might expect 
to find supersymmetry broken dynamically or through brane configurations. There 
is also a vast array of states — perhaps infinite, where supersymmetry is broken 
already by the classical superpotential for the complex structure moduli, i.e. one 
cannot solve the set of equations D,W = 0. 

This is compatible with many of our ideas for thinking about the hierarchy 
problem. In supersymmetry, technicolor, and the Randall—Sundrum approaches, 
one argues that it is reasonable to have exponentially small scales, because it is 
reasonable to think that couplings should be small, but not much smaller than one. 
Why this should be so is unclear. If there is a unique underlying theory, for example, 
the couplings are whatever they are. But within this landscape, there is a distribution, 
which would seem to motivate just the sorts of ideas we have explored. 

Much work is going into mapping the features of this landscape. Many types of 
states have been explored and their statistics at least partially understood, including 
states with and without supersymmetry, with various gauge groups, and the like. 
From these studies, it seems plausible that there are some — possibly many — states 
in the landscape with the low-energy features of the Standard Model. This naturally 
raises several related questions. 


(1) Are typical states with the features of the Standard Model characterized by things we 
might hope to measure, such as large extra dimensions or supersymmetry? 
(2) Why do we find ourselves in a state which looks like ours does? 


If one can answer these questions, one might hope to make real predictions. It 
is, at this writing, too early to answer the first, though one can enumerate some of 
the issues. For example, if there are comparable numbers of supersymmetric and 
non-supersymmetric states, it might be that there are more supersymmetric states 
with a large hierarchy of states than non-supersymmetric ones. If there are vastly 
more non-supersymmetric states, it might be that technicolor or warping are the 
most common ways of obtaining large hierarchies. 

If the landscape really exists with the sorts of properties we are attributing to 
it, it seems likely that the answer to the second question is, in part, environmental. 
The issue is posed most starkly by the cosmological constant. The distribution of 
cosmological constants is not likely to peak at the very tiny but non-zero observed 
value. On the other hand, Weinberg, and also Banks, pointed out long ago that 
if the cosmological constant were much larger than observed (perhaps a factor 
of 10), galaxies would not form. The problem is that the exponential expansion 
of the universe would begin before the perturbations from inflation became non- 
linear. Given that structure formed when the universe was about 1/20 of its present 
age, a cosmological constant far larger than the present energy density would be 


Suggested reading 479 


problematic. This argument predicted a cosmological constant in the observed 
range. 

The idea that some quantities are determined by environmental considerations, 
or anthropic conditions, is quite controversial. At one level, one can ask what would 
happen if several quantities, such as the parameters of the inflationary potential, 
were allowed to vary. At another, this argument requires that the universe, somehow 
in its history, sampled all of the different possible states. Within present ideas about 
inflation, it is not clear whether this is possible. Finally, even granting the first two 
points, this argument rests on the requirement of observers to determine at least 
some of the laws of nature. Some physicists argue that this is no different than 
saying that we find ourselves on a planet at a certain distance from a star because, 
otherwise, the planet could not support life; they point out that through much 
of human history, the Earth-Sun distance was viewed as fundamental. But many 
physicists find this troubling. At this writing, however, it is fair to say that this is 
by far the most plausible explanation we have of the value of A. 

Hopefully, the reader who has worked patiently through this book, has been left 
with challenging questions, and the skills to attack them. As the LHC turns on, as 
further progress is made in understanding how quantum gravity might be reconciled 
with much lower-energy physics, we may be at the threshold of understanding what 
lies beyond the Standard Model. 


Suggested reading 


Weinberg’s argument for the cosmological constant appears in his 1989 paper. 
The landscape picture we have described here was first put forward by Bousso 
and Polchinski (2000) and Feng et al. (2001), and formulated most sharply in 
string theory by Kachru ef al. (2003). Some statistical studies, which significantly 
elucidate the possible meaning of the landscape, appear in Ashok and Douglas 
(2004). 


Part 4 
The appendices 


Appendix A 


Two-component spinors 


The massless Dirac equation simplifies dramatically in the case that the fermion mass is 
zero. The equation 


Dy =0 (Al) 
has the feature that if y is as solution, so is ys: 
D(ys) = 0. (A2) 
The matrices 
P,= xl + ys) (A3) 
are projectors, 
P2 =P, P,P_=P_P,=0. (A4) 


To understand the physical significance of these projectors, it is convenient to use a 
particular basis for the Dirac matrices, often called the chiral or Wey] basis: 


0 of 
wh _ 
yt = & A ) ; (AS) 
where 
o"=(1,o) 6" =(1,-<c). (A6) 
In this basis, 
F -1 0 
ys =iy’yl yy? = ( 0 Of (A7) 


so 


Fa (5 i r= (5 a) (A8) 


We will adopt some notation, following the text by Wess and Bagger: 


y= ( ) (A9) 
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Correspondingly, we label the indices on the matrices 0“ and a" as 


ot aot gh = GHP, (A10) 


aa 


This allows us to match upstairs and downstairs indices, and will prove quite useful. The 
Dirac equation now becomes: 


io} 3.0" =0 ia”**d,.x. = 0. (All) 


Note that x and ¢* are equivalent representations of the Lorentz group; x and ¢ obey 
identical equations. We may proceed by complex conjugating the second equation in 
Eq. (A11), and noting o.0"*02 = a". 

Before discussing this identification in terms of representations of the Lorentz group, it 
is helpful to introduce some further notation. First, we define complex conjugation to 
change dotted to undotted indices. So, for example, 


ge = (9%). (Al2) 
Then we define the antisymmetric matrices €gg and €“? by: 
eta ta—e* eyg = —e?, (A13) 


The matrices with dotted indices are defined identically. Note that, with upstairs indices, 
€ =10), ape” = 6). We can use these matrices to raise and lower indices on spinors. 
Define dy = Eup’, and similar for dotted indices. So 


$a = €ap("?)*. (Al4) 
Finally, we will define complex conjugation of a product of spinors to invert the order of 
factors, so, for example, (Xu¢g)* = o, Xe 
With this in hand, the reader should check that the action for our original 
four-component spinor is: 


S= [asc = ie (i xg Oy Xu + 1 Of, 9,0") 


= [ave = ie (ix Oh Oux™* + ip oh, 0.0") . (A15) 


At the level of Lorentz-invariant Lagrangians or equations of motion, there is only one 
irreducible representation of the Lorentz algebra for massless fermions. 

Two-component fermions have definite helicity. For a single-particle state with 
momentum p = p2, the Dirac equation reads: 


pU +oz)¢ = 0. (A16) 


Similarly, the reader should check that the antiparticle has the opposite helicity. 
It is instructive to describe quantum electrodynamics with a massive electron in 


two-component language. Write 
e 
pole), (A17) 


In the Lagrangian, we need to replace 0,, with the covariant derivative, D,,. Note that e 
contains annihilation operators for the left-handed electron, and creation operators for the 
corresponding antiparticle. Note also that é contains annihilation operators for a particle 
with the opposite helicity and charge of e, and é*, and creation operators for the 
corresponding antiparticle. 
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The mass term, my, becomes: 
mw = mee, + mere*, (A18) 


Again, note that both terms preserve electric charge. Note also that the equations of 
motion now couple e and é. 
It is helpful to introduce one last piece of notation. Call 


WX = Wo xa = Wax” = xX" Wa = XW. (A19) 
Similarly, 
We = Wax = Ws = XG = XO. (A20) 
Finally, note that with these definitions, 


(xv) = x*v*. (A21) 


Appendix B 


Goldstone’s theorem and the pi mesons 


It is easy to prove Goldstone’s theorem for theories with fundamental scalar fields. But the 
theorem is more general, and some of its most interesting applications are in theories 
without fundamental scalars. We can illustrate this with QCD. In the limit that there are 
two massless quarks (i.e. in the limit that we neglect the mass of the wu and d quarks), we 
can write the QCD Lagrangian in terms of spinors 


we (3) (BL) 


= 1 
L= Wiy" Dub — 7 Fi: (B2) 


as 


This Lagrangian has symmetries: 


a aad 


Poi Tw ws ry (B3) 


(t“ are the Pauli matrices). In the limit that two quarks are massless, QCD is thus said to 
have the symmetry SU(2), x SU(2)p. 
So writing a general four-component fermion as 


q= . (B4) 
the Lagrangian has the form: 
L=iVo"'D,WV* +iVo"D, oe. (B5) 
In this form, we have two separate symmetries: 
wo itrw Ps lktyh, (B6) 
Written in this way, it is clear why the symmetry is called SU(2), x SU(2)p. 


Now it is believed that in QCD, the operator WW has a non-zero vacuum expectation 
value, i.e. 


(WW) & (0.3GeV)°5 fp. (B7) 
This is in four-component language; in two-component language this becomes: 


(Wp Wp + Wee) £0. (B8) 
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This leaves ordinary isospin, the transformation, in four-component language, without the 
Ys, Or, in two-component language, with w, = —wR, unbroken. 

But there are three broken symmetries. Correspondingly, we expect that there are three 
Goldstone bosons. To prove this, write 


O=UW; O'= byw. (B9) 
Under an infinitesimal transformation, 
60 = 2iw*O%; 50% =iw"O. (B10) 
In the quantum theory, these becomes the commutation relations: 
[0", O] = 210°; [0%, 0°] = is”O. (B11) 


Q* is the integral of the time component of a current. To see that there must be a massless 
particle, we study 


0= "i d* xd, L(Q\T (j44(x)O?O))|Q)e—?*] (B12) 


(this just follows because the integral of a total derivative is zero). We can evaluate the 
right-hand side, carefully writing out the time-ordered product in terms of 0 functions, and 
noting that dp on the @ functions gives 6-functions: 


0= if d*x(Q\L J" (x), O? (0)15(x°)|Q)e7?* — ip, x i d*x(Q\T (j"“(x)O? (0))|Q). 


(B13) 
Now consider the limit p” = 0. The first term on the right-hand side becomes the matrix 
element of [Q%, O°(0)] = O(0). This is non-zero. The second term must be singular, then, 
if the equation is to hold. This singularity, as we will now show, requires the presence of a 
massless particle. For this we use the spectral representation of Green’s function. In 
general, a pole can arise at zero momentum only from a massless particle. To understand 
this singularity we introduce a complete set of states, and, say for x° > 0, write it as 


dp 
Qj" (x) Ap) (Ap|O?(0)|Q). B14 
a a [iA p)AplO’O|2) (B14) 
In the sum, we can separate the term from the massless particle. Call this particle 2”. On 
Lorentz-invariance grounds, 


(Qj |x?(p)) = fr phd”. (B15) 
Call 
(Ag |O“(x)|0°(p)) = ZI, eo! (B16) 
Considering the other time ordering, we obtain for the left-hand side a massless scalar 


propagator, i/p”, multiplied by Zf, p“, so the equation is now consistent: 


p- 
(Pw) = Sin Z. (B17) 
P 


It is easy to see that in this form, Goldstone’s theorem generalizes to any theory without 
fundamental scalars in which a global symmetry is spontaneously broken. 

Returning to QCD, what about the fact that the quarks are massive? The quark mass 
terms break explicitly the symmetries. But if these masses are small, we should be able to 
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think of the potential as “tilted.” This gives rise to a small mass for the pions. We can 
compute these by studying, again, correlation functions of derivatives of currents. A 
simpler procedure is to consider the symmetry-breaking terms in the Lagrangian: 


Lo = UMY, (B18) 


where M is the quark mass matrix, 


m, O 
m=( ay (B19) 


Since the 2 mesons are, by assumption, light, we can focus on these. If we have a 
non-zero pion field, we can think of the fermions as being given by: 


Wada’ TW, (B20) 


In other words, the pion fields are like symmetry transformations of the vacuum (and 
everything else). 

Now assume that there is an “effective interaction” for the pions containing kinetic terms 
(1/2)(0,,7“)*. Taking the form above for W, the pions get a potential from the fermion 
mass terms. To work out this potential, one plugs this form for the fermions into the 
Lagrangian and replaces the fermions by their vacuum expectation value. This gives that 


V(r) = (Gq) Tr(e'"™" M), (B21) 
one can expand to second order in the pion fields, obtaining: 
m= fz, = (my + m4)(4q). (B22) 
Exercises 


(1) Verify Eq. (B13). 
(2) Derive Eq. (B22), known as the Gell-Mann—Oakes—Renner formula. 


Appendix C 
Some practice with the path integral in field theory 


The path integral is extremely useful, both in field theory and in string theory. This 
appendix provides a brief review of path integration, and some applications. Many of the 
examples are drawn from finite-temperature field theory. These are instructive since one 
can easily write very explicit expressions. They are also useful for understanding the 
high-temperature universe, and are closely connected to computations which arise in 
compactified theories. 


C.1 Path integral review 


Feynman gave an alternative formulation of quantum mechanics, in which one calculates 


amplitudes by summing over possible trajectories of a system, weighting by e'S/”, where 
S is the classical action of the trajectory. For a particle, the path integral is: 
Z= fraxtes™ (C1) 


Here [dx] is an instruction to sum over all possible paths of the particle. 
This generalizes immediately to field theory, where surprisingly it is often more useful: 


Z= / [d@le'®. (C2) 
For a single field, ¢, it is useful to introduce sources, J(x), and to define 
Zit= pagi Te a0e eo, (C3) 


Green’s functions of ¢ can then be obtained by functional differentiation of Z with respect 
to J: 


é 6 
ee Z 
idJ (x1) idJ (Xn) 


For free fields, the integral can be performed by completing the squares. Writing the 
action as: 


T ($1) ---PGn)) = [J]. (C4) 


Stree = fas (50090-'600 Ba 60.400)) (CS) 
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with 
D'=0 —m = p?—m’. (C6) 


We can complete the squares in the action: 
1 
Stree = / d*x (50 + i ay JQ)D.%9) De (e+ i a'z1@)D.)) 


- / d'xd4yJ(x)D(x, y)J(y). (C7) 


Now in the free field functional integral, one can shift the @ integral, obtaining: 
ZolJ] = Ae? [Ex4yI@DEYIO) (C8) 


Here A is the free field functional integral at J = 0. It is the (square root) of the functional 
determinant of the operator D. D itself is the propagator of the scalar. This expression can 
then be used to develop perturbation theory. For example, with a (A/4!)@* interaction, we 


can write: 


Working out the terms in the power series reproduces precisely the Feynman diagram 
expansion. 

This has generalizations to non-Abelian gauge theories, with both unbroken and broken 
symmetries, which we will discuss in the text. We will also find it useful for addressing 
other questions. 


C.2 Finite-temperature field theory 


As an application of path integral methods and because of its importance in cosmology, 
we consider at some length the problem of field theory at finite temperature. 
In statistical mechanics, one is interested in the partition function, 


Z[B] = Tre ?*. (C10) 


For a quantum mechanical system, in contact with a heat bath, this is: 


Z(B1= >i (nleP™"In), (C11) 


n 


where n label the energy eigenstates. 
For a harmonic oscillator (unit mass), H = [(p?/2) + (w?/2)]x?, and the partition 
function is: 


e fF = - eo holnt 3) 
n 


1 


— 9 @B/2 
> 1 — eho” 


(C12) 


Now we can think of 


(x|e7P# |x) (C13) 
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as the amplitude that starting at x one ends up at x after propagating through an imaginary 
time —i6. This can be represented as a path integral: 


(xleP" |x) = / [dx]e atte, (C14) 

(0)=x(B)=x 

where Lg is the Euclidean Lagrangian, 

dx\* 1 
Lp = (=) fe 50% (C15) 

(note the signs here!). The partition function is now 

dxo B 
zip\= | [dxJe te, C16) 
x(0)=x(B)=xo 


i.e. we integrate over the possible values of x at t = 0 in order to take the trace. This is the 
problem of a periodic box in the time direction. For this simple system with one degree of 
freedom, we can write: 


x(t) = UF ane ar (C17) 


We will simplify the problem slightly by taking x(t) to be complex (you can think of this 
simply as an isotropic harmonic oscillator in two dimensions). The action of this 
configuration is: 


[o.e) 
1 
i= 2 5 (on + 0) |ay/°. (C18) 
The path integral is now: 
Z[B1 = I] / da,da*e~*., (C19) 
The integrals are just Gaussian integrals. For a complex variable, z, we have 
i dze-aer = 2 (C20) 
a 
so we have the result for Z: 
1 
Zip1= |] ee (C21) 


where w, = (27n)/T. 

Now before trying to evaluate this product, it is useful to pause and note that this can be 
expressed in terms of the determinant of a matrix. Quite generally, Gaussian path integrals 
take the form of (inverse) determinants. In this case, if we call M the differential operator: 


2 
Mes (-= + o*) (C22) 


its eigenfunctions are just e’®"’, with eigenvalues w? + wm’. So Z is just the inverse 
determinant of M. Had we worked with only one real coordinate, we would have 
obtained the square root of the inverse determinant. 
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The determinant of an infinite matrix may seem a daunting object, but there are some 
tricks that permit evaluation in many cases. The first thing is to write the determinant as a 
sum, by taking a logarithm. In general, 


det M = e 1) (C23) 


(to see this, diagonalize M). It is easier to evaluate derivatives of the determinant rather 
than the determinant itself. We can derive a very useful formula for the derivative of a 
determinant by writing: 


det(M +5M) =e TrIn(M+8M) __ ,(TrIn(M)+In(1+M~'5M)) 
= ¢ TIM), TM'SM ~ det M(1 + TrM~'5M). (C24) 


Dividing by 6M gives the derivative. 
In our case, it is convenient to study: 


ld 1 
=—Z= ai C25 

Z dw 2 ow + 02 oy 
This is progress. Our infinite product is now an infinite sum. The question is: how do we 
do the sum? The trick is to look for a periodic function which is well-behaved at infinity, 


but has poles at the integers. A suitable choice is 


1 
eA: (C26) 
We can then replace any sum of the form, >> f (7), by a contour integral, 
1 1 


Here the contour is a line running just above the real z axis and back just below. The 
residues of the (infinite number of) poles just give back the original sum. 
Now one can deform the contour, taking one line into the upper half plane, one into the 


lower, picking up the poles at z = +iw. This leaves us with: 
dF _ 1 1 1 (C28) 
dwt \eB — 1 eB — 1} 2W’ 
We could analyze this problem further, but let us jump instead to free field theory. Then 
Z[p] = i, [dd]ew FU Gna +m°g"], (C29) 
$(B)=(0) 
In a finite box, with periodic boundary conditions, we can expand: 
OG, 1) = Do ebb Hon ge (C30) 
km 
where @, = 2amT. 
In this form, we have that 
Z[B] = det(—d? + m?)"!?. (C31) 


Again, this is somewhat awkward to work with. It is easier to differentiate: 


1 dZ 


1 ; 1 
Zane z | tao! vee f ateseC) (C32) 
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This is just the propagator, with periodic boundary conditions in the time direction: 


[ foo) = BV (~O)P0)). (C33) 


The propagator is given by: 


(6OGO) = D> See (C34) 


m 


We can convert this into a more sds form by means of the same trick. The 
propagator is given by the expression below: 


Tar a es ee dz I Gas 
(p( P( )) = (272)3 on / eizB —1 (QanT 2 4+ 2 4+ m2 ( ) 


Now deform the contour as before, picking up the poles at tiV k2 + m2. Both poles 
make the same contribution, yielding: 


1 1 1 
2Vk2 + m? Es Btm 1 efa/P+m? _ | 


1 2 
— i . (C36) 
QJ k2 + m2 eB +m? _ ; 


Note the appearance of the Bose-Einstein factors here. Note also the first term has the 
structure of the zero temperature expression for the energy; the second is the finite 
temperature expression. This is what we find differentiating: 


a’k Ly ; ae 

BF=V E,t+ Bb In